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REMARKS 

Claims 72 to 116 are pending. Claims 72 to 88 and 98 to 116 are under consideration. 

Regarding the Claim Amendments 

The amendments to the claims are supported throughout the originally filed 
specification or were made to address informalities. In particular, the amendment to claims 
72 and 73 to recite that the antibody or functional fragment thereof "specifically binds to an 
epitope of an antigen expressed by at least one of the recited cell lines, and "wherein SAM-6 
comprising the amino acid sequences of SEQ ID NO: 1 and SEQ ID NO: 3 specifically binds 
to said epitope of the antigen expressed by at least one of the recited cell lines is supported, 
for example, at page 2, lines 9-10, which discloses "a polypeptide named SAM-6 which 
reacts with an epitope specific for neoplastic cells," and at page 29, lines 11-14, which 
discloses that "the SAM-6 monoclonal antibody or fragments thereof, that are specific for the 

antigen recognized by these polypeptides, may be used in a variety of methods " The 

amendments to claims 72 and 75 to delete adenocarcinoma of esophagus were made to 
eliminate a duplicate term and correct a typographical error. The amendment to delete "F c " 
from claims 77 and 108 was made in order to more clearly indicate that the functional 
fragments have binding activity. Thus, as the claim amendments are supported throughout 
the originally filed specification or were made to address informalities, no new matter has 
been added and entry thereof is respectfully requested. 

Regarding the Information Disclosure Statement 

Applicants note that certain references submitted in an IDS and listed on PAT- 1449 
were not considered by the Examiner since they were not in the English language or were 
apparently missing. Reference DE 4107154A1 (reference LR, PAT-1449) was listed as a 
priority application for U.S. Patent No. 5,610,280. Applicants therefore respectfully request 
consideration of US Patent No. 5,610,280 listed on the attached PTO-1449. Reference DE 
10230516A1 (reference MR, PAT-1449) was published January 1, 2004, and is therefore not 
prior art against the application. Reference DE 69212671T2 (reference NR, PAT-1449) is a 
German translation that lists European patent application 92810056.9. A copy of this 
European patent application, namely Publication 0502812A1, is submitted herewith and 
listed on the attached PTO-1449, and Applicants respectfully request consideration thereof. 
Reference DE 692291 10T2 (reference OR, PAT-1449) is a German translation that lists PCT 
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publication WO 92/1 6624. A copy of this PCT publication is submitted herewith and listed 
on the attached PTO-1449, and Applicants respectfully request consideration thereof. 
Reference DE 69527975T2 (reference PR, PAT-1449) is a German translation that lists PCT 
publication WO 96/16990. A copy of this PCT publication is submitted herewith and listed 
on the attached PTO-1449, and Applicants respectfully request consideration thereof. 
Reference EP 1 141019 Bl (reference RR, PAT-1449) is an issued EP patent in German based 
upon PCT WO 00/37489. PCT WO 00/37489 A2 and A3 were both submitted in the IDS 
(references UR and TR, respectively, PAT-1449), but apparently PCT WO 00/37489A2 is 
missing from the PTO file. A copy of PCT WO 00/37489A2 is re-submitted herewith and 
listed on the attached PTO-1449, and Applicants respectfully request consideration thereof. 
Consequently, EP 1 141019 Bl is cumulative with respect to PCT WO 00/37489A2. A copy 
of PCT WO 05/065418A2 (reference KKR, PAT-1449) is missing from the PTO file and is 
also re-submitted herewith and listed on the attached PTO-1449, and Applicants respectfully 
request consideration thereof. An English translation of the Timmerman et al reference 
(reference GGGGR, PAT-1449) is submitted herewith and listed on the attached PTO-1449, 
and Applicants respectfully request consideration thereof. 

Regarding the Objections to the Specification 

The specification stands objected to due to an alleged lack of cross-reference to the 
priority applications. The specification also stands objected to due to use of trademarks, and 
the alleged absence of x-axis and y-axis labels for Figure 10A and 10B. 

Applicants respectfully point out that a Preliminary Amendment was filed 
concurrently with the application on May 15, 2006. In this Amendment, new claims and an 
amendment to the specification to recite the priority application information (page 1 1 of the 
Preliminary Amendment filed May 15, 2006) were requested to be entered. Consequently, 
the specification should have been amended to recite the priority application information 
upon entry of the Preliminary Amendment filed May 15, 2006. Apparently the USPTO failed 
to enter the amendment reciting the priority application information in the Preliminary 
Amendment filed May 15, 2006. Applicants reiterate the request that the specification be 
amended to add the priority application information and for the sake of expediting 
prosecution by this Response request entry of the amendment to the specification to recite the 
priority application information. 
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In terms of trademarks, the specification has been amended to capitalize 
"CYTOXAN" and introduce a trademark symbol for "TAXOL." In view of the amendment, 
the objection is moot. 

In terms of the objection to Figures 10A and 10B for allegedly not describing the x- 
axis or y-axis, in view of the description of Figure 10A and 10B set forth on page 27, lines 
16-25, the x-axis of 10A and 10B refer to tumor weight and tumor volume, respectively. As 
to the y-axis, in view of the description of Figure 10A and 10B it appears that this axis 
represents a particular mouse. Thus, in view of the foregoing, the axes are adequately 
described and therefore, there is no need to amend the drawing description or Figure 10. 

REJECTION UNDER 35 U.S.C. §112. FIRST PARAGRAPH. ENABLEMENT 

The rejection of claims 72 to 79, 82 to 88 and 98 to 116 under 35 U.S.C. §112, first 
paragraph as allegedly lacking enablement is respectfully traversed. According to the Patent 
Office, allegedly it would require undue experimentation to make and use the claimed 
invention. 

The proper standard for enablement under 35 U.S.C. §1 12, is whether one skilled in 
the art could make and use the invention without undue experimentation. In this regard, "a 
considerable amount of experimentation is permissible, if it is merely routine, or if the 
specification in question provides a reasonable amount of guidance with respect to the 
direction in which the experimentation should proceed." In re Wands 858 F.2d 731, 737 
(Fed. Cir. 1988). 

Here, in view of the guidance in the specification and knowledge and skill in the art at 
the time of the invention, antibody variants having the requisite activity could be produced 
and identified using routine methods disclosed in the specification or known in the art at the 
time of the invention without undue experimentation. Consequently, one skilled in the art 
could make the claimed antibodies and functional fragments that specifically bind to the 
recited epitope of the antigen expressed by at least one of the recited cells, wherein SAM-6 
antibody comprising the amino acid sequences of SEQ ID NO: 1 and SEQ ID NO: 3 
specifically binds to the epitope of the antigen, without undue experimentation. 

First, the level of knowledge and skill in the art regarding making antibodies and 
fucntional fragments thereof was high. For example, methods of producing antibodies and 
amino acid variants without undue experimentation are disclosed in the specification (page 
31, line 20, to page 36, line 26) and were also known in the art at the time of the invention. 
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Methods of producing antibody fragments (e.g., Fv, Fab, Fab' and F(ab') 2 ) were known in the 
art and were routine at the time of the invention. Methods of identifying antibodies and 
fragments that bind antigen without undue experimentation were also known in the art and 
are taught by the specification. In particular, routine methods for detecting antibody binding 
to antigen or cell lines, as well as methods for measuring cell proliferation and apoptosis are 
disclosed in the specification (page 47, line 8 to page 49, line 10; page 55, line 26, to page 57, 
line 29; and page 66, line 10, to page 68, line 24). Thus, in view of the guidance in the 
specification and the high level of knowledge and skill in the art at the time of the invention, 
one skilled in the art could readily make antibodies and functional fragments that specifically 
bind to the recited antigen without undue experimentation. 

Claims 72 to 79, 82 to 88 and 98 to 116 are analogous to In re Wands, 858 F.2d 731 
(Fed. Cir. 1988), where the court held that screening hybridomas to determine those that 
produced monoclonal antibodies having a particular binding characteristic did not require 
undue experimentation. Likewise, for the claimed antibodies and functional fragments, undue 
experimentation would not be required to screen variants to determine those that bind given 
that 1) producing antibody variants and fragments was routine; and 2) cell binding, antibody 
competition, proliferation and apoptosis assays are disclosed in the specification and were 
also routine assays in the art at the time of the invention. 

Thus, contrary to the assertion in the Office Action (pages 8-9) where it is essentially 
stated that one skilled in the art would need to "predict which variation would not 
compromise antigen binding specificity," there is no need for the skilled artisan to "predict 
which variation would not compromise antigen binding specificity" in advance in order to 
make variants and functional fragments because making variant antibodies and functional 
fragments and screening to determine those with binding activity was routine and well 
established at the time of the invention. For example, instead of attempting to "predict which 
variation would not compromise antigen binding specificity," the skilled artisan could simply 
introduce mutations in a heavy and/or light chain variable region and then screen antibodies 
and fragments with the mutant variable region sequences for those that bind. By way of 
example, Boder et al. fProc. Nat' 1 Acad. Sci . USA 97:10701 (2000), submitted herewith as 
Exhibit A) describes directed evolution of scFv fragments using the method developed by 
Stemmer W. P., Nature 370:389 (1994). A large number of Fv sequences had improved 
binding affinity, with a dissociation rate greater than 1000 fold slower than the native non- 
mutagenized antibody (see, abstract). The authors even went so far as to remark "[fjhe 
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relative ease with which extremely high affinity has been attained in this study." (page 
10705, first column, last full paragraph) Consequently, at the time of the invention one of 
skill in the art could have readily produced variants of antibodies and functional fragments 
without having to "predict which variation would not compromise antigen binding 
specificity" without undue experimentation. 

Furthermore, case law has never required that enablement under 35 U.S.C. §1 12, first 
paragraph be satisfied by a particular methodology, i.e., a need to "predict which variation 
would not compromise antigen binding specificity" in order to make variant antibodies and 
functional fragments having binding activity. Here, in view of the guidance in the 
specification and knowledge in the art at the time of the invention, the skilled artisan could 
readily produce and identify antibody variants and functional fragments of SEQ ID NO: 1 and 
3 without predicting the effect of particular substitutions, insertions or deletions, and without 
undue experimentation. Consequently, one skilled in the art need not analyze nor need to 
predict the effect of any substitution, insertion or deletion a priori in order to satisfy 
enablement under 35 U.S.C. §112, first paragraph. 

Turning to additional grounds in the Action, contrary to the assertion that the claims 
are directed to single CDRs (pages 9-11), the claims recite antibodies and functional 
fragments that bind to an epitope of an antigen expressed by at least one the recited cell types. 
An antibody, by definition, includes three CDRs in at least a heavy chain variable region 
sequence. Consequently, as the claims are directed to antibodies that bind to an epitope of an 
antigen, which include 3 CDRs on the heavy chain variable region sequence, the ground for 
rejection relating to a single CDR is not relevant. 

In terms of the publications cited at pages 12-14 of the Action as evidence that 
conservative amino acid substitutions in the CDR/FR residues "change binding ability of 
antibodies if not substantially reduce the affinity," Applicants respectfully point out that the 
claims do not recite a particular binding affinity. Consequently, even if a change were to 
increase or decrease affinity for the epitope of the antigen expressed by at least one of the 
recited cells types, that antibody is included, provided binding is detectable by an assay. 

Furthermore, Applicants again point out that insofar as there may be publications that 
indicate that substitutions alter antibody affinity, as discussed above there is no need for one 
of skill in the art to predict which mutations would not destroy binding activity (even though, 
as discussed below, the skilled artisan would have a high degree of confidence that sequences 
amenable to substitution would be known). Rather, the skilled artisan could simply introduce 
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of a CDR or FR region of in SEQ ID NOs: 1 and 3 would likely not destroy antigen binding 
activity. Consequently, the skilled artisan would be able to predict with reasonable certainty 
variants of SEQ ID NOs: 1 and 3 that would have at least partial antigen binding activity. 

In addition, the skilled artisan knows that antibody FRs and CDRs can tolerate 
substitutions. To corroborate that substitutions within CDRs are tolerated, submitted 
herewith as Exhibit B is a publication by Kipriyanov et al. ( Protein Engineering 10:445 
(1997)). In Exhibit B the authors report that a substitution of a cysteine residue by a serine 
within CDR3 of an antibody heavy chain variable region did not have an adverse effect on 
affinity. Thus, Exhibit B affirms that CDRs can tolerate amino acid substitutions. 

To corroborate that substitutions within FRs are tolerated, submitted herewith as 
Exhibit C is a publication by Holmes et al. (J. Immunol. 167:296 (2001)). The authors of 
Exhibit C report several heavy chain variable region FR substitutions of an anti-lysozyme 
antibody did not destroy binding activity. Thus, Exhibit C affirms that FRs tolerate 
substitutions. 

To corroborate that insertions and deletions of amino acid residues in heavy and light 
chain variable regions, including CDRs, are tolerated submitted herewith as Exhibit D is a 
publication by Wilson et al. (J. Exp. Med. 187:59 (1998)). The authors of Exhibit D report a 
number of insertions and deletions of variable heavy chains that occur naturally during 
affinity maturation which are tolerated. Thus, Exhibit D affirms that heavy and light chain 
variable regions tolerate insertions and deletions. 

To further corroborate that insertions and deletions of amino acid residues in heavy 
and light chain variable regions, including CDRs, are tolerated submitted herewith as 
Exhibit E is a publication by Lantto and Ohlin (J. Biol. Chem. 277:45108 (2002)). The 
authors of Exhibit E report that single amino acid insertions or deletions of CDRs 1 and 2 of 
heavy chain variable region of an antibody were well tolerated. Thus, Exhibit E affirms that 
heavy or light chain variable CDRs tolerate insertions and deletions. 

In sum, in view of the guidance in the specification and the high level of knowledge 
and skill in the art with respect to antibody structure correlating with function at the time of 
the invention, the skilled artisan would know of general regions and particular residues that 
would be more or less amenable to substitution and could therefore predict with a high degree 
of confidence variants of SEQ ID NOs:l and 3 likely to have at least partial antigen binding 
activity. Given the large number of amino residues in variable regions including residues 
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outside of CDRs clearly many variants of SEQ ID NOs:l and 3 that would not destroy 
binding activity could be predicted with a high degree of confidence. 

In view of the foregoing, the skilled artisan could make antibody variants and 
functional fragments as claimed without undue experimentation. Consequently, claims 72 to 
79, 82 to 88 and 98 to 1 16 are adequately enabled under 35 U.S.C. § 1 12, first paragraph, and 
Applicants respectfully request that the rejection be withdrawn. 
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CONCLUSION 

Please charge any fees associated with the submission of this paper to Deposit 
Account Number 033975. The Commissioner for Patents is also authorized to credit any 
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Single-chain antibody mutants have been evolved in vitro with 
antigen-binding equilibrium dissociation constant Ka = 48 fM and 
slower dissociation kinetics (half-time > 5 days) than those for the 
streptavidin-biotin complex. These mutants possess the highest 
monovalent ligand-binding affinity yet reported for an engineered 
protein by over two orders of magnitude. Optimal kinetic screen- 
ing of randomly mutagenized libraries of 10 5 -10 7 yeast surface- 
displayed antibodies enabled a > 1,000-fold decrease in the rate of 
dissociation after four cycles of affinity mutagenesis and screening. 
The consensus mutations are generally nonconservative by com- 
parison with naturally occurring mouse Fv sequences and with 
residues that do not contact the fluorescein antigen in the wild- 
type complex. The existence of these mutants demonstrates that 
the antibody Fv architecture is not intrinsically responsible for an 
antigen-binding affinity ceiling during in vivo affinity maturation. 

Over 100 antibody therapeutics are currently in clinical trials 
for cancer, viral, autoimmune, and other diseases (1). 
High-affinity antigen recognition is at the heart of all such 
therapies and is generally attained by affinity maturation in vivo 
in the mammalian immune system. It has been observed that, 
during in vivo affinity maturation, the B cell response exhibits an 
apparent affinity ceiling near K4 > 0.1 nM, because of the 
inability to discriminate slower dissociation kinetics relative to 
intrinsic B cell receptor internalization rates (2, 3). Because 
affinity is a critical variable for therapeutic applications such as 
antibody tumor targeting (4-7), extensive efforts have been 
made to generate higher affinity antibodies by directed evolu- 
tion. The highest antibody affinities reported to date are =» 
10-20 pM (8-10). 

Antibody/hapten recognition in the 4-4-20 /fluorescein model 
system used in this study has been characterized extensively by 
structural (11, 12), thermodynamic (13), kinetic (14), computa- 
tional (15), spectroscopic (16), and mutagenic (17) means. The 
4-4-20 affinity for the f luorescein-biotin (FL-bio) hapten {Ka = 
0.7 ± 0.3 nM in PBS) is near the affinity ceiling of the tertiary 
immune response. The 4-4-20 scFv antibody fragment can be 
displayed on the surface of yeast, and mutants with increased 
affinity can be isolated by flow cytometric cell sorting (18). 
Herein, we report that optimal kinetic screening of randomly 
mutagenized libraries of 10 5 -10 7 yeast surface-displayed 4-4-20 
antibodies enabled a > 10,000-fold decrease in the rate of 
dissociation (in PBS) after four cycles of affinity mutagenesis 
and screening. These in vitro evolved single-chain antibody 
mutants have an antigen-binding equilibrium dissociation con- 
stant Ki = 48 fM and slower dissociation kinetics (half-time > 
5 days) than those for the streptavidin-biotin complex, repre- 
senting the highest reported monovalent ligand-binding affinity 
for an engineered protein by over two orders of magnitude. The 
existence of these mutants demonstrates that the antibody Fv 
architecture does not possess an intrinsic affinity ceiling for 
antigen binding. 

Materials and Methods 

Construction of Randomly Mutated scFv Libraries. A single point 
mutant of the 4-4-20 scFv (mutant 4 Ml.l) with 3-fold higher 
affinity was isolated previously from a random library generated 



in a mutator E. coli strain (18). A random scFv library based on 
this clone was created by adapting the sexual PCR method of 
Stemmer (19). The expression cassettes of 4 Ml.l and the 
unselected mutator strain-generated library were amplified by 
PCR with T3 and T7 promoter standard primers. An equimolar 
mixture of amplified 4 Ml.l and library PCR products (approx- 
imately 30 /j,g) were fragmented to <200 bp by DNase I, purified 
by gel-filtration with Centri-Sep columns (Princeton Separa- 
tions, Princeton), and recombined essentially by following the 
method of Stemmer (19) replacing Taq polymerase with Pfu 
polymerase and performing 45 cycles of PCR. Final PCR 
amplification was performed with Taq polymerase in the pres- 
ence of 2.25 mil MgCl 2 and 0.375 mM MnCl 2 to introduce 
further mutations. Primers for final amplification included se- 
quences of =100 bp 5' and 50 bp 3' flanking the scFv ORF to 
allow efficient enzymatic digestion. PCR products were purified 
by agarose gel electrophoresis and electroelution, digested with 
Nhel and Xhol, gel purified, and ligated into digested pCT302 
backbone (18). Ligation reactions were exchanged into distilled 
water, concentrated with Centricon-30 and Microcon-50 filters 
(Amicon), and transformed into DH10B Electrocompetent cells 
(Life Technologies, Grand Island, NY). Transformants were 
pooled, and aliquots were plated to determine library diversity 
before amplification and purification of plasmid DNA with the 
Qiagen (Chatsworth, CA) Maxiprep kit. Library DNA was 
transformed subsequently into yeast strain EBY100 (11) by using 
the method of Gietz and coworkers (http://tto.trends.com). 
Transformants were pooled in SD-CAA medium [6.7% (vol/vol) 
yeast nitrogen base/2% (vol/vol) glucose/5% (vol/vol) casamino 
acids/100 mM sodium phosphate, pH 6.0], and aliquots were 
plated to determine library diversity. Subsequent libraries were 
constructed with similar methods, but these methods began with 
plasmid DNA recovered from a mixed yeast population screened 
for improved binders in the prior round and the unselected 
library from the prior round. PCR-amplified DNA from these 
sources was mixed at a 9:1 molar ratio before DNase I fragmen- 
tation. Incorporation of 10% unselected library DNA and 
error-prone final PCR amplification allowed introduction of 
further mutations as well as recombination of selected mutations 
in the resulting library. Individual libraries ranged from 3 x 10 5 
to 2 X 10 7 clones. 

Screening of Yeast-Displayed scFv Libraries. Yeast cells [10 s in 2.5 
ml of TBS (20 mM Tris base/137 mM sodium chloride, pH 7.6)] 
displaying mutagenized 4-4-20 scFvs were first incubated with 1 
IxM. FL-bio to saturate surface binding, washed, and then 
incubated in 1 ;u,M 5-aminofluorescein competitor for a fixed 
period at 25°C. Competition times were calculated from a 
mathematical model (20). Labeling with streptavidin-R- 
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phycoerythrin after competition allowed isolation by flow cyto- 
metric sorting of those cells retaining the highest levels of bound 
FL-bio. Cells were costained with 12CA5 mAb (Roche Molec- 
ular Biochemicals) to normalize for cell-to-cell variation in 
expression level as described (18). Cells were sorted with gate 
settings determined by using a mathematical analysis (20); four 
rounds of sorting and regrowth were performed to isolate highly 
enriched («80%) populations of improved mutants. 

Kinetic and Sequence Analysis of Isolated Clones. Individual clones 
chosen at random were assayed for dissociation rate constants 
(&diss)- Cells were saturated with FL-bio (1.0 juM; room temper- 
ature; 30 min), washed, labeled with streptavidin-R-phyco- 
erythrin [1:100 streptavidin-phycoerythrin (PharMingen); 30 
min on ice], and incubated at 25°C with at least a 100-fold molar 
excess of 5-aminofluorescein (10 /j,M vs. ^1 nM surface- 
displayed antibody in the reaction). Aliquots were removed and 
analyzed by flow cytometry at various times; dissociation rate 
constants were determined by fitting fluorescence data to a 
first-order kinetic model. Plasmid DNA was recovered from 
yeast cells by glass bead lysis and phenolxhloroform extraction 
(21), followed by purification with the Wizard DNA Cleanup kit 
(Promega) and transformation into competent DH5a, DH0B, or 
XLl-Blue cells. scFv ORF sequences were determined by the 
University of Illinois Biotechnology Center by dideoxy termi- 
nator sequencing with an ABI Prism 377. Sequenced clones were 
chosen based on differing values of /caiss to avoid sequencing 
multiple isogenic clones. 

Fluorescence Quenching of Antigen by Soluble scFv Protein. scFvs 
were subcloned and expressed solubly in Saccharomyces cerevi- 
siae under conditions as described (22). Protein was recovered 
from raw culture supernatants by affinity chromatography with 
FITC-coupled BSA. scFv-containing fractions were purified 
further by FPLC on a Hi-TrapQ anion exchange column (Am- 
ersham Pharmacia). Protein concentration, binding constants, 
and maximal fluorescence quenching constants were deter- 
mined by direct titration with FL-bio in an SLM Aminco SPF500 
spectrofluorometer. Excitation wavelength was 492 nm, and 
emission was collected at 510 nm; all experiments were con- 
ducted in quartz cuvettes thermostated to 25°C. Dissociation 
rate constants were measured by addition of a 500-fold molar 
excess of 5-aminofluorescein (nonfluorescent competitor) at 
time 0 to an equilibrated mixture of scFv and FL-bio in PBS (150 
mM NaCl/10 mM sodium phosphate buffer, pH 8) or low salt 
buffer (LSB; 1 mM sodium phosphate, pH 8). Association rate 
constants were determined by injection of FL-bio into equili- 
brated scFv samples and fitting quenching data to a model with 
kdis S values determined from competitive dissociation experi- 
ments. Control experiments lacking 4 M5.3 scFv, FL-bio, or 
5-aminofluorescein were performed in parallel to ensure the 
absence of artifacts from adsorption, evaporation, photobleach- 
ing, or other processes over the course of the experiment. 

Results and Discussion 

The complete 4-4-20 scFv ORF was mutagenized by error-prone 
DNA shuffling (19). Fluorescently labeled clones exhibiting 
slowed antibody-hapten dissociation kinetic constants were 
identified and isolated by flow cytometry with optimal screening 
and sorting conditions calculated from a mathematical model 
(20). Up to 20 improved clones were selected randomly for 
individual measurement of the dissociation rate constant fcdiss, 
and 10 improved clones exhibiting the widest range of k^s values 
were selected for further analysis and are represented in Fig. 1A. 
The complete collection of isolated mutants was then recom- 
bined by modified DNA shuffling, together with further error- 
prone PCR mutagenesis. This cycle of mutagenesis and screen- 
ing was repeated three times, resulting in mutant scFvs with 
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Fig. 1. Dissociation kinetics of mutant scFv at 25°C. (A) Yeast displaying 
mutant scFv isolated from a random library was assayed for antigen dissoci- 
ation rate. The 4-4-20 and V H S95A values were calculated from fluorescence 
quenching data with soluble scFvs. Dissociation rates for barnase/barstar (45), 
streptavidin/biotin (46), avidin/biotin (46), and trypsin/bovine pancreatic 
trypsin inhibitor (BPTI) (47) are indicated for comparison. (S) Dissociation 
kinetics of purified, soluble 4 M5.3 and 4-4-20 scFvs were assayed by fluores- 
cence quenching. Triplicate traces for 4-4-20 scFvin PBS (150 mM NaCl/10 mM 
sodium phosphate buffer, pH 8) are shown, and 1 2 independent replicates are 
shown for 4 M5.3 in PBS or LSB. T1/2, half-time for dissociation. 
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Table 1. Binding and dissociation rate constants for soluble scFvs 



K a , fM 



scFv 


Buffer 


/c ass (X 10" 7 M-^s" 1 ) 


^ diss (X 106 S - 1 ) 


^diss/^ass 


Direct titration 


4-4-20 


PBS 


8.0 ± 3.5 


25,100 ± 500 


310,000 


700,000 ± 300,000 




LSB 


41.0 ± 8.0 


4,270 ± 100 


10,000 


ND 


4 M5.3 


PBS 


0.6 ± 0.03 


1.6 ± 0.3 


270 


ND 




LSB 


2.9 ± 0.9 


1.4 ± 0.4 


48 


ND 



ND, not determined. 



dissociation rate constants over four orders of magnitude slower 
than the ka ss of the 4-4-20 scFv (Fig. 1). As a basis for 
comparison, the dissociation rates for other high-affinity 
protein-ligand complexes are also shown in Fig. 1. The half-time 
for dissociation of the 4 M5.3 mutant is over 5 days, such that 
optimal screening of a fifth library would require kinetic com- 
petition times longer than 3 weeks (20) for each of four subse- 
quent enrichment sorting steps. 

A clone from the fourth library screen (4 M5.3) was solubly 
expressed, and the association and dissociation constants were 
measured in solution (Table 1; Fig. IB). These kinetic values are 
in quantitative agreement with those measured on the cell 



surface (Fig. IA). Electrostatic steering of fluorescein into the 
binding pocket (23, 24) is evidenced by the salt dependence of 
fcass- The association rate constant of 4 M5.3 is ^14-fold lower 
than that of 4-4-20 scFv in both PBS and LSB. This decrease 
somewhat offsets the 16,000-fold improvement in k iiss (3,000- 
fold in LSB), resulting in K A = 270 fM in PBS and K d = 48 fM 
in LSB. To our knowledge, the 4 M5.3/FL-bio interaction is 
the highest affinity noncovalent protein-ligand complex yet 
engineered. 

Sequences were obtained of each of the mutants represented 
in Fig. 1, and mutated residues are shown in Table 2. With the 
exception of 4 M4.15, 16, and 19 (which are identical isolates), 



Table 2. Mutations in selected scFvs 
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these gain-of-function mutations would complement loss-of- 
function mutational studies of strong protein-ligand interactions 
such as streptavidin-biotin (35). 

The majority of current antibody engineering strategies focus 
mutagenesis on the antibody CDR loops (e.g., refs. 8, 10, 36, and 
37), an approach that would not identify 4 of the 10 consensus 
mutations from this study. In fact, the first consensus mutation 
to become fixed (V H W103L) occurred at a highly conserved 
framework position removed from the binding site; the possi- 
bility that 1 or more subsequently selected mutations depended 
on the context of the V H 103 mutation must be considered. The 
experimentally simpler strategy of error-prone PCR of the whole 
scFv gene was therefore arguably more effective given these 
results. Concerns regarding potential immunogenicity of frame- 
work mutations in a therapeutic antibody might be addressed by 
judicious choice of buried residues from among the selected 
mutations. 

The relative ease with which extremely high affinity has been 
attained in this study might be attributed to (i) quantitatively 
optimized screening methodology and (ii) minimization of ex- 
pression bias by use of a eukaryotic expression host. In the first 
instance, labeling for optimal discrimination of improved clones 
(20) and reduced stochastic variation in dissociation kinetics 
because of assay of 10 4 -10 5 scFv molecules per yeast cell enable 
fine discrimination of affinity (reproducibility of ±10% in /c d iss 
assayed by flow cytometry for yeast-displayed scFvs and ±30% 
for K d ; ref. 38). In the second instance, yeast's secretory biosyn- 



thetic apparatus effectively folds and displays or secretes (22) the 
4-4-20 scFv, a molecule that forms inclusion bodies and is 98% 
insoluble when expressed in the periplasm of wild-type E. coli 
(39). Thus, a greater proportion of protein shape space will be 
sampled by yeast display because of elimination of prokaryotic 
expression biases against scFvs such as 4-4-20. 

Generation of antibodies that bind essentially irreversibly 
relative to the relevant physiological time scale could improve 
efficacy for cancer immunotherapy with noninternalizing tumor- 
associated antigens (4-7) and passive immunization against viral 
and microbial pathogens (40-43). The mutagenesis and screen- 
ing methodology described herein has also been applied suc- 
cessfully to an antibody against hen egg lysozyme (data not 
shown) and an antibody against the T cell receptor (44). Beyond 
the implications of the general capability to engineer femtomolar 
affinity antibodies, more specifically, a molecule such as 4 M5.3 
that binds and quenches fluorescein with an affinity similar to 
that of streptavidin-biotin may enable interesting approaches in 
analytical biochemistry when combined with the array of avail- 
able fluorescence-detection methodologies. 

The 4-4-20 scFv was obtained from D. M. Kranz, who also provided 
helpful comments on the manuscript. We thank G. Durack for assistance 
optimizing cell-sorting instrumentation. Funding was provided by the 
Whitaker Foundation and National Institutes of Health Grant 
GM55767. 
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Recombinant antibody fragments directed against cell sur- 
face antigens have facilitated the development of novel 
therapeutic agents. As a first step in the creation of cytotoxic 
immunoconjugates, we constructed a single-chain Fv frag- 
ment derived from the murine hybridoma OKT3, that 
recognizes an epitope on the e-subunit of the human CD3 
complex. Two amino acid residues were identified that are 
critical for the high level production of this scFv in 
Escherichia coll. First, the substitution of glutamic acid 
encoded by a PCR primer at position 6 of V H framework 
1 by glutamine led to a more than a 30-fold increase in the 
production of soluble scFv. Second, the substitution of 
cysteine by a serine in the middle of CDR-H3 additionally 
doubled the yield of soluble antibody fragment without 
any adverse effect on its affinity for the CD3 antigen. The 
double mutant scFv (Q,S) proved to be very stable in vitro: 
no loss of activity was observed after storage for 1 month 
at 4°C, while the activity of scFv containing a cysteine 
residue in CDR-H3 decreased by more than half. The 
results of production yield, affinity, stability measurements 
and analysis of three-dimensional models of the structure 
suggest that the sixth amino acid influences the correct 
folding of the V H domain, presumably by alfecting a folding 
intermediate, but has no effect on antigen binding. 
Keywords: affiniiy/anti-human CD3/bacterial expression/ 
single-chain Fv/solubility 



Introduction 

In recent years, the use of genetic engineering techniques has 
stimulated the development of antibody-like molecules for 
therapeutic and diagnostic uses (Winter and Milstein, 1991). 
Unlike glycosylated whole antibodies, fragments such as Fab 
and Fv can be easily produced in bacterial cells as functional 
antigen binding molecules (Better et at., 1988; Skerra and 
Pliickthun, 1988). To stabilize the association of the recombin- 
ant V H and V L domains, they have been linked in a single- 
chain Fv (scFv) construct with a short peptide that connects 
the carboxy terminus of one domain and Che amino terminus 
of the other (Bird et ai, 1988; Huston el ai, 1988). In 
comparison with the much larger Fab, F(ab) 2 and IgO forms 
of monoclonal antibody from which they are derived, scFvs 
have more rapid blood clearance and better tumor penetration 
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(Milenic et ai, 1991; Yokota et al., 1992; Adams et ai, 1993). 
ScFvs therefore represent potentially highly useful molecules 
for the targeted delivery of drugs, toxins or radionuclides to a 
tumor site. 

The efficient expression of active antibody fragments in 
bacteria is clearly of great technological importance. However, 
as with the expression of some other heterologous proteins in 
Escherichia coli, the yield of functional product for some 
antibody fragments can be very low. Sometimes. PCR primer- 
induced errors can lead to the expression of non-reactive 
antibody fragments (McCartney et ai, 1995). Poor expression 
may also arise from differences in the translation machinery 
and folding pathways of eukaryotic and bacterial cells. For 
example, some nucleotide sequences encoding antibody vari- 
able regions were expressed as functional proteins in eukaryotic 
host cells but were unable to express a produci in bacteria 
(Duenas et ai, 1995). Limiting factors for the efficient produc- 
tion of secreted antibody fragments in E.coli appear to be 
translocation to the periplasm (Ayala et ai, 1995) and folding 
in the periplasmic space (Knappik and Pliickthun, 1995). 

OKT3 is a murine monoclonal antibody (mAb) that recog- 
nizes an epitope on the E-subunit of the human CD3 complex 
(Kung et ai, 1979; Van Watiwe et ai, 1980; Transy et ai, 
1989). It has significant clinical utility. OKT3 has been widely 
used to suppress T cells and thereby prevent the rejection of 
transplants (Thistlethwaite et ai, 1984; Woodle et ai, 1991). 
Conversely. T cell activation and proliferation induced by 
OKT3 have been exploited to expand effector cells ex vivo 
for adoptive cancer immunotherapy (Yannelly et ai, 1990). 
As well as being used alone, the OKT3 mAb has been used 
as a component of bispecific antibodies to retarget cytotoxic 
T lymphocytes against tumor cells (Nitta et ai, 1990; Bohlen 
et ai, 1993) or virus infected cells (Sanna et ai, 1995). 
Recently, humanized versions of the OKT3 mAb have been 
expressed in COS cells (Woodle et ai, 1992; Adair etai, 1994). 

In this paper, we present the first example of the expression 
of an OKT3 derived scFv in E.coli. As part of the anti-CD3 
scFv construction process, the PCR amplified OKT3 V H gene 
was modified to improve its in vivo folding. Here we analyze 
the effect of two amino acid residues in the variable heavy 
chain domain on the yield, affinity and stability in vitro of 
anti-CD3 scFv. 

Materials and methods 

E.coli strains, plasmids and cell lines 

E.coliK\ 2 strain XLI-Blue (Stratagene, La Jolla, CA) was used 
as the cloning and expression host. For cloning, sequencing 
hybridoma-derived immunoglobulin variable regions and site- 
specific mutagenesis, pCR-Script SK(+) (Stratagene) was 
used. The scFv gene was assembled and expressed either in 
the plasmid pOPE51 (Kipriyanov et ai, 1994) or in pHOG21 
(Kipriyanov et ai, 1996b). The hybridoma OKT3 producing 
a monoclonal antibody (IgG2a) against the CD3 human T cell 
antigen has been described previously (Kung et ai, 1979; Van 
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Wauwe et at, 1980). The human CD3-posilive acute T cell 
leukemia cell line Jurkat and a CD3-nega(ive B cell line JOK-I 
were used for flow cytometry. 
Cloning of the variable regions 

Isolation of mRNA from freshly subcloned hybridoma OKT3 
cells and cDNA synthesis were performed as previously 
described (Dttbel et at, 1994). DNA coding for the light chain 
variable domain was amplified by PCR using the primers Bi5 
and BiS that hybridize to the amino terminal portion of the ic 
chain constant domain and the framework I (FRI) region of 
the K chain variable domain (DUbel et at, 1994). For the 
amplification of DNA coding for the heavy chain variable 
domain, the primer Bi4 that hybridizes to the amino terminal 
portion of the y chain constant 1 domain (DUbel el at, 1994) 
and Bi3f that hybridizes to the FRI region of the heavy chain 
(Goiter et at, 1995; Kipriyanov et at, 1996b) were used. The 
50 nl reaction mixture contained 10 pmol of each primer and 
50 ng of hybridoma cDNA, 100 pM each of dNTP, 1 X Vent- 
buffer (Boeringer Mannheim. Mannheim, Germany), 5 ug 
BSA and 1 U Vent DNA polymerase. 30 cycles of I min at 
95°C, 1 min at 55°C and 2 min at 75°C were carried out in a 
thermocycler. The amplified DNA was purified with a QIA- 
quick PCR Purification Kit (Qiagen, Hilden, Germany) and 
blunt end ligated into an Srfl digested pCR-Script SK(+) 
(Stratagene) for dideoxy sequencing (Sanger et at, 1977) and 
site-specific mutagenesis. 
Construction of plasmids encoding scFv 
The linker used in this study was a 17 amino acid tag-linker 
that includes a tubulin epitope recognized by mAb YOLI/34 
(Breitling et at, 1991). DNA coding for the variable domains 
of OKT3 was inserted into pOPE51 (Kipriyanov et at, 1994) 
in two cloning steps using Ncol/Hindlll for the heavy-chain 
DNA and EcoRV/BamHl for the light-chain DNA. The whole 
scFv gene was recloned in pHOG21 (Kipriyanov era/., 1996b) 
as a NcoVBamHl DNA fragment. 

Construction of anti-CD i mutants 

Mutations were generated in the V H domain derived from 
OKT3 by site-specific mutagenesis according to Kunkel et at 
(1987). The amino acid substitution of Cys at position HI00A 
by Ser and of Glu at position H6 by Gin was achieved using 
either primer SKI 5 '-GTAGTCAAGGCTGTAATGATCATC 
or SK2 5'-GCCCCAGACTGCTGCAGCTGCAC or both. 
Exoli expression and purification of scFv fragments 
XLl-Blue E.coli cells (Stratagene) transformed with the scFv 
expression plasmid pHOG21 were grown overnight in 2XYT 
medium with 50 p.g/ml ampicillin and 100 mM glucose 
(2XYT GA ) al 3 7 ° c - Dilutions (1:50) of the overnight cultures 
in 2X YTqa were grown as flask cultures at 37°C with shaking 
at 200 r.p.m. When cultures reached ODaxj = 0.8, bacteria 
were pelleted by centrifugation at 1500 s f° r '0 m i n and 20°C 
and resuspended in the same volume of fresh 2XYT medium 
containing 50 pg/ml ampicillin and 0.4 M sucrose. IPTG was 
added to a final concentration of 0.1 mM and growth was 
continued at room temperature (20-22°C) for 20 h. The cells 
were harvested by centrifugation at 5000 g for 10 min and 
4°C. The culture supernatant was retained and kept on ice. To 
isolate soluble periplasmic proteins, the pelleted bacteria were 
resuspended in 5% of the initial volume of ice-cold 50 mM 
Tris-HCI, 20% sucrose, I mM EDTA, pH 8.0. After a I h 
incubation on ice with occasional stirring, the spheroplasts 
were centrifuged at 30 000 g for 30 min and 4°C leaving the 



soluble periplasmic extract as the supernatant and spheroplasts 
plus the insoluble periplasmic material as the pellet. The 
culture supernatant and the soluble periplasmic extract were 
combined, clarified by additional centrifugation (30 000 g, 
4°C, 40 min) and passed first through a glass filter of pore 
size 10-16 pm and then through a Membrex TF filter of pore 
size 0.2 pm (MembraPure, Lorzweiler, Germany). The volume 
was reduced 10-fold by concentration with Amicon YM 10 
membranes (Amicon, Witten, Germany). The concentrated 
supernatant was clarified by centrifugation and thoroughly 
dialyzcd against 50 mM Tris-HCI, 1 M NaCI, pH 7.0 at 
4"C. Immobilized metal affinity chromatography (IMAC) was 
performed al 4°C using a 5 ml column of Chelating Sepharose 
(Pharmacia) charged with Ni 2+ and equilibrated with 50 mM 
Tris-HCI, I M NaCI, pH 7.0 (start buffer). The sample was 
loaded by passing the sample over the column. It was then 
washed with 20 column volumes of start buffer followed by 
start buffer containing 50 mM imidazole until the absorbance 
(280 nm) of the effluent was minimal (about 30 column 
volumes). Absorbed material was eluted with 50 mM Tris- 
HCI, I M NaCI, 250 mM imidazole, pH 7.0. After buffer 
exchange to 50 mM MES, pH 6.0, the protein was further 
purified on a Mono S ion-exchange column (Pharmacia). The 
purified scFv was dialyzed into PBS ( 1 5 mM sodium phosphate, 
0.15 M NaCI, pH 7.4). For long-term storage, scFv were 
frozen in presence of BSA (final concentration 10 mg/ml) and 
kept at -80°C, as recommended (Kipriyanov et at, 1995). 

Isolation of scFv from inclusion bodies of bacteria trans- 
formed with plasmid pOPE5l was performed essentially as 
described previously (Kipriyanov et al., 1996a). 
SDS-PACE and Western blot analysis 
SDS-PAGE was carried out according to Laemmli (1970) 
under reducing conditions. Immunoblol analysis using anti 
c-myc mouse mAb 9EI0 (Cambridge Research Biochemicals, 
Cambridge, UK) was performed as described previously 
(Kipriyanov el at, 1994). 

Analyses of scFv stability 

For stability analyses, scFv preparations were stored at 4°C at 
a concentration 50 pg/ml in PBS for 1 month. The activities 
of samples after storage were determined by flow cytometry. 
Flow cytometry 

We incubated 5X10 5 CD3 + Jurkat or CD3" JOK-1 cells in 
50 ul RPM1 1640 medium (Gibco BRL, Eggenstein, Germany) 
supplemented with 10% fetal calf serum (FCS) and 0.1% 
sodium azide (referred to as complete medium) with 100 pi 
of a sample containing scFv for 45 min on ice. After washing 
with complete medium, the cells were incubated with 100 pi 
of 10 ug/ml anti c-myc mAb 9E10 (1CI Biochemicals) in the 
same buffer for 45 min on ice. After a second washing cycle, 
the cells were incubated with 100 pi of FITC-labeled goat 
anti-mouse IgG (Gibco BRL) under the same conditions as 
before. The cells were then washed again and resuspended in 
100 pi of a I pg/ml solution of propidium iodide (Sigma, 
Deisenhofen, Germany) in complete medium to exclude dead 
cells. The relative fluorescence of stained cells was measured 
using a FACScan flow cytometer (Becton Dickinson, Mountain 
View, CA). 

Measurement of binding affinity 

Affinities were derived either from the FACScan analysis of 
direct binding of scFv to Jurkat cells as described by Chamow 
et al. (1994) or from a competitive inhibition assay. In the 
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latter case, increasing concentrations of scFv were added to 
a subsaturating concentration of FITC-labeled mAb OKT3 
(7.4 nM) and were incubated with Jurkat cells as described 
above. Fluorescence intensities of stained cells were measured 
as described above. Binding affinities were calculated accord- 
ing to the following equation derived from that of Schodin 
and Kranz (1993): 

K m = (1 + IF1TC-OKT3] x ^oktj.VICjo 

where I is the unlabeled inhibitor (scFv). (FITC-OKT3) is the 
concentration of FITC-labeled mAb 0KT3, /faoicn) is the 
binding affinity of mAb OKT3 (I.2X10 9 M"'; Adair et at., 
1994) and IC50 is the concentration of inhibitor that yields 
50% inhibition of binding. 

Determination of the yield of soluble antibody fragments 
The expression levels of soluble scFv fragments were deter- 
mined in cleared culture medium and in crude periplasmic 
extracts isolated from shake-tube mini-cultures (5 ml). Culture 
supernatants were concentrated 20-fold using an Ullrafree- 15 
Biomax-10 centrifugal filter device (Millipore, Bedford, MA, 
USA) and dialyzed into PBS. The periplasmic extracts from 
cell pellets were prepared as described previously (Kipriyanov 
et at., 1996b). For each scFv variant, three independent 
expression cultures were used. The concentrations of functional 
recombinant antibody fragments were determined from the 
fluorostaining of jurkat cells using samples of periplasmic 
preparations and concentrated culture medium by the interpola- 
tion of their mean fluorescence intensities on the standard 
curves obtained with purified scFv of known concentration. 
At least four dilutions of samples were used for calculations. 

Molecular modeling 

Modeling was performed using AbM (Oxford Molecular, 
Oxford. UK). The framework was built by homology using 
HyHEL-5 (Sheriff et ai, 1987) for the parent light chain and 36- 
71 (Strong etal.. 199l)forthe heavy chain. The complementarity 
determining regions (CDR) LI, L2, L3, HI and H2 were built 
using canonical classes as proposed by Chothia et al. (1989) 
(CDR-LI = Class I. CDR-L2 = Class I, CDR-L3 = Class I. 
CDR-HI - Class 1, CDR-H2 = Class 2) while CDR-H3 was 
built using the CAMAL algorithm (Martin etal., 1989). 

AbM sometimes has problems with junction regions where 
loops are spliced on to the framework. This can result in 
trigonal planar or D-amino acids at these junction sites. This 
occurred for residue HI02 and this residue was rebuilt manually 
as an L-amino acid. 

Other methods 

Protein concentrations were determined by the Bradford dye- 
binding assay (Bradford, 1976) using the Bio-Rad protein 
assay kit (Bio-Rad Laboratories, Munich, Germany). The 
concentrations of purified scFv were calculated from the A^n 
values using the extinction coefficient e 1 ro!/ml = 1.84 derived 
from the Trp, Tyr and Phe content of the molecule using 
DNAid+1.8 Sequence Editor for Macintosh (F.Dardel and 
P.Bensoussan, Laboraioire de Biochimie, Ecole Polytechnique, 
Palaiseau, France). Analytical gel filtration of the scFv prepara- 
tion was performed in PBS using a Superdcx 75 HRIO/30 
column (Pharmacia). The sample volume and flow rate were 
200 u.1 and 0.5 ml/min, respectively. For calibration of the 
column, a Low Molecular Weight Gel Filtration Calibration 
Kit (Pharmacia) was used. 
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Fig. I. Critical features of ihe OKT3 amigen binding site. The molecular 
model of OKT3 Fv is shown as a Co itace ivilh side chains of amino acid 
residues HIOOA and H6. 

Results 

Modification of PCR amplified OKT3 V H gene 
The V region genes of the murine monoclonal antibody OKT3 
(Van Wauwe etal. 1980) were amplified by PCR from a cDNA 
preparation using two pairs of highly degenerate primers. Ten 
clones of each amplified V H and V L gene were sequenced and 
found to be identical. All the analyzed heavy chain variable 
regions contained QVQLQE as the N-terminal sequence, This 
region was encoded by the 5' primer Bi3f that contained a 
degenerate codon corresponding either to glutamic acid or 
glutamine residue at position 6 (Gotter et ai. 1995; Kipriyanov 
et al., 1996b). The OKT3 scFv gene was assembled in Ihe 
plasmid pOPE5l (Kipriyanov et al., 1994) and expressed in 
E.coli. The resulting recombinant scFv product contained an 
unpaired cysteine residue near to its C-terminus that was 
specially introduced for making bivalent antibodies by 
chemical conjugation or for site-specific biotinylalion 
(Kipriyanov et al.. 1994). 

FACScan analysis demonstrated no binding of scFv-OKT3 
isolated from periplasmic inclusion bodies (Kipriyanov et al., 
1994; Kipriyanov el al., 1995) id CD3-positive Jurkat cells 
(data not shown). A detailed analysis of the predicted structure 
based on the OKT3 V domain sequences allowed us to identify 
two amino acid residues in the V H domain that might be 
critical for Ihe activity of this recombinant antibody (Figure 
I). First, a comparison with the OKT3 cDNA sequence (Adair 
et al., 1994) showed that position 6 of FRI was occupied by 
glutamine but not by glutamic acid as in the PCR amplification 
product. Furthermore, the consensus sequences of the Kabat 
database demonstrated that the cloned V H gene fragment 
belongs to mouse immunoglobulin subgroup lib (Kabat et al.. 
1991), in which 92% of the members have Q in position 6. 
Second, the OKT3 V H domain was found to contain a cysteine 
residue in the CDR-H3 which could interfere with folding by 
disrupting normal disulfide bonding, or might be oxidized 
during IMAC on an Ni column under denaturing conditions 
(Kipriyanov etal., 1994). Therefore, we performed site-specific 
mutagenesis of the V H gene to substitute E6 by Q and CIO0A 
(numbering scheme of Kabat etal. (1991)) by S. This double- 
mutant scFv-dmOKT3 (Q,S) demonstrated strong binding to 
CD3-positive Jurkat cells and no interaction with CD3-negative 
JOK-1 cells when purified from inclusion bodies (data not 
shown). 
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Construction and expression of anti-CD3 seFv mutants 
To clarify how the amino acid changes described above 
contribute to the activity of the anti-CD3 scFv, we investigated 
four different scFv variants: a variant containing E6 and 
C100A that was amplified from hybridoma cDNA by PCR 
(E,C), a variant corresponding to the cDNA sequence published 
for OKT3 (Q.C; Adair etal., 1994) and two variants containing 
Ser instead of Cys at V H position I0OA (E.S and Q.S). 

To avoid working with inclusion bodies, which have to be 
refolded and to prevent vector-derived C-terminal unpaired 
cysteines from affecting the scFv properties (e.g. possible 
formation of an additional intramolecular disulfide bond with 
Cys-IOOA or scFv dimerization), we chose the plasmid 
pHOG2l for expressing the mutated scFv genes (Figure 2A). 
The bacterial pHOG21 expression vector was designed for 
the high-level production of soluble recombinant antibody 
fragments in E.coii (Kipriyanov et al., 1996b). The antibody 
V H fragment is preceded by a pelB leader sequence for 
secretion of recombinant antibody into the periplasmic space. 
The C-terminus of the V H domain and N-terminus of the V L 
domain are joined by a flexible 17 amino acid tag-linker that 
includes a tubulin epitope recognized by mAb YOL1/34 
(Breitling et al., 1991). A short peptide tag containing an 
epitope of the proto-oncogenc c-myc recognized by mAb 9EI0 
(Evan et al., 1985) is located at the C-terminus of the V L 
domain followed by six histidine residues to facilitate the 
isolation of recombinant antibody fragments by IMAC. The 
sequence of the OKT3 derived scFv assembled in the plasmid 
pHOG21 is shown in Figure 2B with the mutations at amino 
acid positions 6 and I00A of the heavy chain indicated. 

Recently, we showed that the addition of 0.4 M sucrose to 
the growth medium gives a 15-25-fold increase in the yield 
of soluble scFv for bacterial shake-tube cultures and an 80- 
150-fold increase for shake-flask cultures (Kipriyanov et al., 
1997). We also found that the scFv could be made to accumulate 
in the periplasm or be secreted into the medium by simply 
changing the incubation conditions and the concentration of 
the inducer. Therefore, to obtain higher yields of soluble anti- 
CD3 antibody fragments, we incubated induced E.coii cells in 
the presence of 0.4 M sucrose. Western blot analysis of cell 
pellets and periplasmic extracts of bacterial cultures expressing 
the four variants of OKT3 derived scFv demonstrated substan- 
tial differences in the ratio of soluble and total scFv (Figure 
3). While the total amount of recombinant product found in 
the cell pellet seemed to be equal for all scFv variants, much 
less soluble scFv was found for variants containing Glu at 
position 6 (Figure 3, lanes 2 and 6). FACScan analysis 
demonstrated the specific binding of periplasmic extracts for 
all the anti-CD3 scFv variants to CD3-positive Jurkat cells, 
although the fluorescence intensity obtained for scFvs with E6 
was significantly lower (Figure 4A), 

Purification of anti-CD3 scFv variants 
To clarify whether the difference in antigen binding activity 
of periplasmic extracts containing different scFv variants 
(Figure 4A) is due to the difference in affinity or merely 
reflects the production levels of soluble antibody fragment, we 
performed a large-scale isolation of scFv using shake-flask 
bacterial cultures in the presence of 0.4 M sucrose. Under 
these conditions, we previously found that most of the secreted 
scFv was released into the medium (Kipriyanov el al.. 1997). 
The supernatant and periplasmic content of the induced bac- 
terial culture was concentrated and passed through an Ni 2+ 
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Fig. 2. The structure of ScFv and expression vector. fA) Schematic 
representation of the plasmid pHOG21. Ap", ampicillin resistance-encoding 
gene; c-myc. a sequence encoding an epitope recognized by the monoclonal 
antibody 9EI0: ColEI, origin of DNA replication; ///C, intergenic region of 
phage f 1 ; His 6 , a sequence encoding six C-lerminal histidine residues; 
linker, a sequence encoding 17 amino acids connecting the V„ and V L 
domains; pe/B, signal peptide sequence of bacterial pectate lyase; P/O. wt 
lac promoter/operator. (B) The nucleotide and deduced amino acid 
sequences of the scFv derived from hybridoma OKT3, The amino acid 
sequences corresponding lo the complementarity determining regions (CDR> 
are shown shaded. The nucleotide sequences corresponding to PCR primers 
are underlined. The sequences coding for the VOL epitope in linker region 
as well as c-myc epitope and six hisudines in the carboxy terminal part of 
the scFv are indicated. 

charged Chelating Sepharose column. After washing the 
column with buffer containing 50 mM imidazole, the bound 
scFv was eluted with 250 mM imidazole as a single peak in 
2.5 column volumes. This purification procedure allowed us 
to isolate scFv in one step with a purity of about 95% (Figure 
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Fig. 3. Western bloi analysis of cell pellets and periplasms exuacls from 
Exoli clones expressing different anli-CD3 scFv variants. Lanes: I, 3. 5. 7. 
total cell lysale from induced bacteria corresponding to 100 ul of culture: 2. 
4. 6. 8. periplasmic extracts corresponding to 180 pi of culture. The scFv 
were detected using mAb 9610 recognizing the C-tcrminal c-myc epitope. 
As a control, 1 ug of pure scFvdmOKT3 isolated from inclusion bodies 
was used. The positions of molecular mass markers are shown on the left. 



5A). The main contaminant present in samples of scFv purified 
by IMAC has recently been identified as an E.coli metal- 
binding 27 kDa WHP protein (WUlfing et ai, 1994). An 
analysis of its amino acid composition showed that the WHP 
protein has an isoelectric point (fa) of 5.16: anti-CD3 scFv 
variants were found to be more basic (the calculated fa was 
between 7.27 for the E,C and 7.52 for the Q,S variant). This 
charge difference allowed us to purify the recombinant antibody 
fragments to homogeneity by ion-exchange chromatography 
on a Mono S column (Figure 5B). Analytical gel-filtration on 
a Superdex 75 column demonstrated that all the isolated scFv 
preparations consisted only of monomers (data not shown). 
Affinity and stability measurements 

Our attempts to use radioiodinated scFv preparations for 
measuring the direct binding of recombinant antibodies to 
CD3-positive Jurkat cells were unsuccessful. Unfortunately, 
iodination using chloramine-T yielded an inactive product for 
both anti-CD3 scFv and Fab fragment prepared from mAb 
OKT3 (data not shown). It is possible that iodination blocked 
tyrosine residues in the CDR regions that may be important 
for antigen-binding (Figure 2B). We therefore employed two 
different non-radioactive approaches based on flow cytometry 
(Bohn, 1 980) that do not require any modification of the protein. 

In the first approach, recombinant antibody fragments were 
incubated with cells as in a standard radioprotein binding 
assay, except that an anti-c-myc mAb and fluorescent anti- 
mouse IgG reagent were used to detect the amount of bound 
scFv. In comparison with a standard radioligand binding assay, 
the same variables (except the number of molecules bound at 
saturation) can be measured and an affinity constant determined 
from the slope of the resultant Scatchard curve (Chamow 
el ai., 1994). 

The binding of scFv preparations was measured using human 
Jurkat cells as a source of naturally expressed cell bound 
CD3e. Binding to CD3-negative JOK-I cells was used as a 
negative control. The results of fluorostaining of Jurkat cells 
displayed in Figure 4B demonstrate that the same concentra- 
tions of different scFv variants yield similar fluorescence 
(slightly higher values were obtained for variants containing 
Gin at position 6). A pattern of increased fluorescence with 
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Fig. 4. Flow cytometric analysis of the binding of anti-CD3 scFvs to Jurkat 
cells. (A) Analysis of binding of periplasmic extracts. IB) Analysis of 
binding of pure scFv preparations at concentration 25 ug/ml. The presence 
of two peaks of fluorescence indicates that not all cells or Ibe used line 
express CD3 antigen. As a negaUve conuol. binding to CD3-negative 
JOK-I was used. 

increased amounts of scFv was observed that seems to reach 
a plateau at higher concentrations (Figure 6A). On the basis 
of fluorescence measurements at different concentrations of 
added scFv, typical Scatchard curves were generated from 
which K a values were derived (data not shown). 

In a second approach, the binding efficiency of anti-CD3 
scFv variants to Jurkat T cells was investigated by competition 
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Fig. 5. 12% SDS-PAGE analysis of scFv preparalioas. (A) Analy.sU of scFv-dmOKT3 (Q.S) ai different steps of purificalion. Lanes: M. molecular mass 
markers (values in kDa are shown on the left): I. iodI cell lysaie; 2. soluble periplastic content: .1. concentrated culture medium: 4. scFv isolated by IMAC; 
5, scFv purified by ion-enchange chromatograpliy. (B) Analysis of purified scFv preparations for variant E.C (lane 1 1. Q.C (lane 2). E,S (lane 3) and Q.S 
(lane 4). The gels were stained with Coomass'ie Brilliant Blue. 
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Fig. 6. Analyses of apparent affinities of scFv variants by flow cytometry. IAJ Direct binding of different scFv variants to Jurkat cells. The cells were 
incubated with various concentrations of OKT3 derived scFv variants: E.C ( filled squares). Q.C (open circles). E.S (open squares) and Q.S (filled circles). The 
curves were constructed by plotting the mean fluorescence intensity against the scFv concentration. (B) Inhibition of binding of FITC-OKT3 to Jurkat cells in 
presence of mAb OKT3 (filled triangles) and scFv variants: Q.C (open circles). E.S (open squares) and Q.S (filled circles). 



with FITC-labeled mAb OKT3. The data presented in Figure 
6B demonstrate that the OKT3 derived scFv Q,C, E,S and 
Q.S variants compeled similarly, all at -100 times the concen- 
tration of the intact IgG OKT3. 

Analysis of the stability of anti-CD3 scFv variants after 
storage in PBS for 1 month at 4°C demonstrated a substantial 
loss of antigen-binding activity for scFv containing Cys in 
CDR-H3 (Figure 7). 

Table I summarizes the results of the affinity and stability 
measurements. The apparent affinity values obtained for all 
the scFv variants proved to be quite close, indicating (i) only 
a slight effect of the sixth amino acid on the antigen binding 
and (ii) that the replacement of Ser for Cys in the middle of 
CDR-H3 does not disturb the antigen-antibody complex. Both 
the glutamic acid at position H6 and especially the cysteine 
at position of H100A led to a decreased stability of the scFv, 



probably because of a higher tendency for such antibody 
fragments to aggregate and/or for oxidation of unpaired cyst- 
eine residues during storage. No proteolytic degradation during 
storage was detected for any of the examined scFv variants 
(data not shown). 

Analysis of expression yields 

To study the influence of positions H6 and H100A on the 
production levels of soluble scFv fragments, we analyzed the 
antigen binding activities of periplastic extracts and the 
concentrated culture medium of bacteria expressing scFv E,C, 
Q,C, E,S and Q,S variants. The expression yield data presented 
in Table I demonstrate that a single amino acid substitution of 
E6 by Q yields more than a 30-fold increase in soluble scFv 
product. In contrast, the single exchange of C100A by S led 
to a more moderate twofold increase in soluble scFv. These 
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effects were cumulative: the total yield of the Q.S variant was 
66-fold higher than' that for the E.C scFv variant (Table I). 
For all the examined variants, a small proportion of the 
functional soluble scFv was found to be released into the 
culture medium. 

Discussion 

Recombinant antibody fragments directed against cell surface 
antigens can provide useful components for the development 
of therapeutic agents. To target cytotoxic effector T cells to a 
tumor site, we have constructed an anti-human CD3 single- 
chain antibody by PCR amplification of the immunoglobulin 
variable domain genes from cDNA of the hybridoma OKT3. 
Expression of the assembled scFv gene in E.coli yielded a 
non-functional product after refolding from inclusion bodies. 

In general, the primers we and other workers use for 
amplifying V genes from hybridoma cDNAs are designed to 
match all the known sequences of immunoglobulin genes. 
However. PCR amplification using degenerate primers does 
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Fig. 7. Flow cytometric analyses of infraction of 0KT3 derived scFv 
variants with Jurltat cells. (A) Fluorescence intensity obtained for fresh scFv 
preparations at concentration 25 ug/ml. (B) Fluorescence intensity obtained 
for the same scFv preparations after storage in PBS for I month at 4*C. As 
controls, the interaction of culture medium. mAb OKT3 and irrelevant mAb 
HD20 with Jurkat cells is shown. 



not always yield a gene with naturally occurring codons in the 
primer region (McCartney et at., 1995). It is therefore often 
not possible to know which codons occur naturally if, as in 
our case, the DNA sequence was not then available. For 
example, the same set of primers resulted either in Glu or Gin 
in H6 after amplification of the Vh gene of an antibody against 
anti-human CD19 (Kipriyanov et ai, 1996b). Regarding the 
significance of this position, there was no indication in the 
literature that it may be critical for bacterially expressed 
antibody fragments. 

To improve the properties of the recombinant antibody 
fragment, we focused on the amino acid residues which are 
structurally uncommon for the V H subgroup lib: glutamic acid 
at the position H6 of FR1 and a cysteine in the middle of the 
CDR-H3 loop. Site-specific mutagenesis and a change of 
expression system (soluble secreted scFv versus inclusion 
bodies) allowed us to clarify their influence on the production 
of a functional scFv antibody fragment. 

We demonstrated that a single amino acid substitution of E 
by Q at position 6 of the heavy chain resulted in a 30-fold 
increase in soluble scFv product and significantly increased 
the stability of the recombinant molecule during storage. 
However, this substitution had very little effect on the affinity 
(scFv containing Q had affinity constants about 1.5 times 
higher than variants with E). This slight difference may be 
explained by the possible difference in the percentage of 
functional scFv (Kipriyanov el at., 1994). We can therefore 
conclude that the sixth amino acid influences the correct 
folding of the V H domain, perhaps by affecting some folding 
intermediate, but it has little or no effect on antigen binding. 
This conclusion was supported by computational molecular 
modeling. Examination of the residues which surround position 
6 of the heavy chain in the three-dimensional model reveals 
no reason why a Glu or Gin residue should have any significant 
effect on the conformations of the CDRs (Figure I). 

It is not clear how Glu may effect the folding of the scFv 
fragment in the bacterial environment because very little is 
known about this process. Attempts have been made to prevent 
the side reaction of aggregation by overexpressing some 
known enzymes of the E.coli folding machinery such as 
the GroES/L chaperones, disulfide-isomerase and proline-o'i-- 
frani--isomerase (Knappik el aJ., 1993; Duenas el al., 1994). 
However, these proteins did not increase the yield of soluble 
antibody fragments. The presence of a periplasmic chaperone 
has therefore been postulated but not yet identified (Wulfing 
and Pliickthun. 1994). From a variety of experiments, evidence 
is accumulating that the primary sequence of the antibody 



Table 1. Expression levels of anti-CD3 scFv variants, their stabilities and affinities: lo human CD3 antigen 


scFv Yield of scFv 
variant (u.e/1 of culture)* 


Release 

of scFv (%)'' 


AC." 

fM-'/IO') 


iM-'/lO')' 


Stability 
l*f 


E.C 72.7 * 19.5' 
Q.C 2314.7 i: 578.8 
E.S 148.4 ± 37.7 
Q.S 4846.0 £ 477.3 


9.6 2 1.4' 
24.4 ± 1.7 
25.4 ± 7.9 
29.4 ± 1.9 


109 
1.96 
1.27 
1.42 


n.d.« 
3.16 
2.49 
2.95 


37.87 
46.63 
70.10 
100 



/Total amount of soluble scFv both in crude periplasmic extract and culture medium estimated by Row cytometry. 

Percentage of loial scFv amount found in culture medium. 

■■"Binding constants as determined by cylofluorometric Scaichard analysis. 

''Binding constants as determined from a competitive inhibition assay using FITC-OKT3. 

'■'Activity (%) alter i month at4*C as determined by fiow cytometry. 

'Arithmetic mean and standard deviation based on three independent experiments. 

-*Not determined. 
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plays a decisive role in the efficiency of folding in a bacterial 
environment (Carter et ai, 1992; Knappik and Pliickihun, 
1995). Our own results lead to a similar conclusion. The amino 
acid H6 influences not only the folding efficiency but also the 
stability of the correctly folded scFv. 

The single exchange of Cys at position HIOOA by Ser also 
led to a twofold increase in soluble scFv, Three residues before 
the start of CDR-H3 is a conserved cysteine at position H92 
which forms a structural disulfide bond with position H22. 
Thus, having another Cys nearby (at HIOOA) could easily 
allow mis-folding where HIOOA instead of H92 is involved 
in forming the disulfide bond with H22, thereby generating a 
mis-folded, insoluble and non-functional product. Analogously, 
Ostermeier et al. (1995) demonstrated that substitution of an 
uncommon cysteine at position H50 (first amino acid of CDR- 
H2) by a serine led to a 20-fold increase in soluble Fv 
production. Since the authors of this work were working with 
an Fv fragment, a mutation in the V H domain can only 
influence the yield of the heavy chain fragment. Although 
direct comparisons of the influence of Cys residues in different 
CDRs on correct folding cannot be made, these results suggest 
that such uncommon residues may be more critical for the 
folding of a single antibody domain (V H ) than for scFv. 

In our case, a cysteine was substituted that is present directly 
in the middle of CDR-H3. which is in the middle of the 
antigen-combining site and generally has the greatest influence 
on binding affinity. CDR-H3 plays a prominent role not only 
in ligand binding, but also in the contact with the V L domain 
and with the other CDRs (Padlan, 1994). Although cysteine 
can occasionally form hydrogen bonds, this is rare in proteins 
(Baker and Hubbard, 1984) and it is a relatively hydrophobic 
residue. We therefore considered two possible mutations at 
HIOOA: serine (maintaining the size as closely as possible, 
but introducing a very hydrophilic residue) and valine (increas- 
ing the hydrophobic nature, but adding an extra atom). Given 
that the residue is exposed to solvent in the model, we 
chose to make the mutation to serine since any increase in 
hydrophobicity could lead to a change in folding of the loop. 
We were aware, however, that the substitution might interfere 
with antigen binding or influence the contact between the 
variable domains. Fortunately, the Cys to Ser mutation had no 
effect on antigen binding and, as hoped, led to a significant 
improvement in the stability of the scFv. Although the exposure 
patterns of the various amino acid types in immunoglobulins 
are comparable to those in other water-soluble proteins, 
cysteines are more exposed in CDRs than they are in the 
framework regions (Padlan, 1994). This is especially true for 
short (10 residues or less) hypervariable loops which do not 
have much opportunity to bury one of their residues while 
maintaining a distorted hairpin conformation for antigen bind- 
ing. Exposure of the cysteine 100A SH group to solvent may 
result in oxidation or modification of the group over time and 
this may have an influence on the stability of the antigen- 
antibody complex. It is also possible that the unpaired cysteines 
of two adjacent scFv molecules could form a disulfide bond, 
thus giving rise to inactive and probably insoluble scFv dimers 
and causing a decrease in the concentration of functional scFv. 
These factors would explain the experimentally observed 
instability during the storage of scFv variants containing Cys 
in CDR-H3. 

It is worth noting that in the present work we actually 
compared two different strategies of folding, i.e. in vivo and 
in vitro. The renaturation procedure, which has been used to 



refold several antibody fragments (Kipriyanov et ai, 1994; 
Goiter et al., 1995) and a more complex scFv::streptavidin 
fusion protein (Kipriyanov et ai. 1996a) did not lead to the 
formation of an active scFv-E,C variant. These results point 
to limitations in the folding strategy in vitro compared with 
in vivo and indicate how such problems can be overcome. 

In conclusion, we have constructed a modified version of 
an anti-human CD3 scFv antibody fragment with improved 
stability in vitro and increased production level in bacteria. 
This molecule may be particularly useful for the creation of 
recombinant cytotoxic immunoconjugates. 
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Structural Effects of Framework Mutations on a Humanized 
Anti-Lysozyme Antibody 1 

Margaret A. Holmes,* Timothy N. Buss,* and Jefferson Foote 2 ** 

A humanized version of the mouse anti-lysozyme Ab D1.3 was previously constructed as an Fv fragment and its structure was 
crystallographically determined in the free form and in complex with lysozyme. Here we report five new crystal structures or 
single-amino acid substitution mutants of the humanized Fv fragment, four of which were determined as Fv-lysozyme complexes. 
The crystals were isomorphous with the parent forms, and were refined to free X values of 28-31% at resolutions or 2.7-2.9 A. 
Residue 27 in other Abs has been implicated in stabilizing the conformation or the first complementarity-determining region 
(CDR) or the H chain, residues 31-35. We find that a Phe-to-Ser mutation at 27 alters the conformation or immediately adjacent 
residues, but this change is only weakly transmitted to Ag binding residues in the nearby CDR. Residue 71 or the H chain has been 
proposed to control the relative disposition or H chain CDRs 1 and 2, based on the bulk or its side chain. However, in structures 
we determined with Val, Ala, or Arg substituted in place of Lys at position 71, no significant change in the conformation of CDRs 
1 and 2 was observed. The Journal of Immunology, 2001, 167: 296-301. 



Humanized Abs arc created by replacing Ihe complcmen- 
larily-determining regions (CDRs) 3 of a human Ab (as 
defined by Wu and Kabai; Refs. I, 2) with Ihe corre- 
sponding CDRs of a nonhuman Ab (3). This CDR graft transfers 
the antigenic specificity of the CDR donor molecule, but leaves Ihe 
new engineered molecule immunologically human, inasmuch as 
the immunogenicity of humanized Abs in humans is extremely low 
(4. 5). The first humanized Ab was specific for the hapten nitro- 
phenacetyl. This molecule had been CDR grafted in the H chain 
only, which was coexpressed with a mouse L chain. The human- 
ized anti-nitrophenaceiyl showed 1.5- to 3-fold reduced hapten af- 
finity relative to a control molecule with murine sequences in both 
chains (3). This finding of altered afiinily proved that framework 
residues can influence the structure of (he Ag combining site. Riech- 
mann e( al, (4) confirmed this finding in a humanized anti-CD52. 
The initial humanized construct showed weak avidity. A single 
Ser-to-Phe mutation al framework residue H27 J restored avidity to 
near thai of the fully murine control. The importance of framework 
residues in maintaining the structure of the CDRs and the frequent 
need for mulaiional revisions in the framework have since been 
confirmed many more limes during the engineering of humanized 
Abs to have avidity matching that of their murine antecedents (6). 
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We developed a humanized anti-lysozyme (HuLys) as a model 
system for studying structural issues attending the transfer of 
CDRs from a murine to a human framework (7-9), Thus, murine 
and human segments for ihe construction were chosen from among 
Ab V domains whose structures had been determined. The six 
CDRs of HuLys come from the murine Ab DI.3, which was raised 
against hen egg lysozyme (10, 1 1). The structure of Ihe DI.3 het- 
erodimer of H and L chain V regions (Fv) has been determined al 
1.8-A resolution in both ihe liganded and unliganded forms (12, 
13). The HuLys H chain framework (residues HI-H30, H36-H49, 
H66-H94, and H103-HI 13 in the Kabat numbering system) 
comes from the human myeloma protein NEW, whose structure 
has been determined ai 2.0 A (14). The k L chain framework 
(residues LI-L23, L35-L49, L57-L88, and L98-LI08) is a con- 
sensus sequence similar lo that of the human Bence-Jones prolein 
REI, also determined al 2.0 A (15). 

The crystal structures of the HuLys Fv in free form (16) and 
completed with the Ag lysozyme (17) were previously determined 
al 2.9 and 2.7 A, respectively IBrookhaven Protein Data Bank 
accession numbers IBVL and IBVK). In this work, we describe 
crystal structures of a series of single substitution mutants of the 
HuLys Fv, viz H27S, H71V, H7IA, and H7IR. 

Materials and Methods 

Prolein engineering 

Fvs were expressed in Escherichia coli using die pAK 19 vector (18), which 
uses a phoA promoter and heat-stable enteroloxin II leader sequence. This 
vector directs gene products to the periplasm, from which correctly folded, 
disulfide-oxidized molecules are released after cell harvest. Material used 
in the present work was released from the periplasm by osmotic shock and 
purified by affinity chromatography on lysozyme-Sepharose. as described 
previously (17). Prolein concentrations were determined spectrophoto- 
metrically, using calculated extinction coefficients (19). 

Crystal growth 

Crystals of ihe four mutant complexes were grown in the same way as the 
nalive complex crystals (17). Each of the HuLys Fv solutions was mixed 
wilh a lysozyme solution in equimolar proportions. The mixtures then sat 
from several hours to 2 days. PBS was added to dilute the solution, which 
was cemrifuged before use. Protein concentrations ranged from 6.5 to 10.5 
mg/ml. The reservoir for vapor diffusion was 0.8 M K 2 HPO„, 0.8 M 

0022-1 767/0 1/S02.00 
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Structure 



H27S Complex 



H7IV Complex 



H71A Complex 



Space group 
Fv/asymmetric unit 
Cell dimensions (A) 
Resolution (A) 
Measured reflections 
Unique reflections 
Completeness (%) 
« value" 
Average lAr, 



P4,2,2 
2 

a=n=97.9; r= 173.3 
50.0-2.7 2.75-2.70 



132.657 

23.047 

96.3 

0.065 

10.1 



>2S80 

1060 

90.3 

0.315 

2.6 



P4,2,2 
2 

=6=96.3: c= 175.7 
50.0-2.7 2.75-2.70 



141.035 

22.318 

95.1 

0.067 

18.1 



>3001 

1133 

97.5 

0,404 

2.3 



P4,2,2 
2 

o=/>=97.6; r= 174.1 
50.0-2.7 2.75-2.70 



108,158 

21.229 

88.9 

0.059 

17.4 



>227l 
1044 

90.5 

0.310 

2.6 



H7IR Complex 



H7IV 



P4,2,2 
2 

a=o=97.l; c= 174.8 
50.0-2.7 2.75-2.70 



141,201 

22.707 

95.6 

0.065 

16.5 



>2672 

1025 

89.0 

0.376 

2.3 



50.0-2. 

88.937 

16.6)8 

92.3 

0.075 

17.2 



P4,2,2 
2 

146.8; f =7l.9 
9 2.95-2.90 

> 1849 

751 

87.0 

0.403 

2.5 



NaHjPOj, 0.1 M HEPES. pH 6.5. Silling drops consisting of equal vol- 
umes of complex solution and reservoir solution were set up in 
microbridges. 

Crystals of the uncomplexcd H7IV Fv were grown by macroseeding. 
The seeds were obtained from a hanging drop vapor diffusion crystalliza- 
tion that used 16 mg/ml protein and a reservoir of 0.74 M sodium citrate. 
0.01% NaN v pH 6.5. Two rounds of seeding were performed. Each lime, 
a few crystals were removed from the drop and placed in a microbridge in 
a fresh drop composed of equal volumes of Fv solution (16 mg/ml) and 
reservoir solution (0.8 M sodium citrate, 0.01% NaNj, pH 6.5). 

Data collection 

X-ray diffraction data seis were collected from single crystals at 4°C using 
an R axis detector. The data sets were processed with DENZO and SCALE- 
PACK (20, 21). Details of the processing are given in Table I. Before 
refinement, the data sets were partitioned into a working set and a lesl set. 
The lest sets for the complexes contained only reflections that had made up 
the lesl sei for the refinement of the native complex structure, so as to 
maintain the independence of the test set (22). The lesl set for the uncom- 
plexed Fv was created by X-PLOR (23). as the refinement of the native Fv 
structure did not involve a lest set. 

Refinement 

Refinement of the structure of the HuLys H27S Fv-lysozymc complex 
began wiUi the model of the native complex with residue H27 changed 10 
Gly. A round of rigid body refinement at 3.5- A resolution was followed hy 
rounds of positional refinement at 2.7-A resolution using X-PLOR and 
model building of the loop containing the mutation. A cycle of torsion 
angle molecular dynamics refinement was run, followed by more rounds of 
positional refinement and model building. Omit map density was sufficient 
to model only one of the two H27 side chains. The refinement was com- 
pleted with a cycle of individual B value refinement with TNT (24) and a 
cycle of X-PLOR B value refinement. Refinement statistics are given in 
Table II. 

Refinement of the structures or the HuLys H7IV. H7IA, and H7IR 
Fv-lysozyme complexes was more straightforward. The starling model was 



the native compjex with H7I changed to Gly. A round of rigid body re- 
finemeni at 3.5-A resolution was followed by a cycle of positional refine- 
ment at 2.7-A resolution, addition of the H7 1 side chains to the model, and 
a second cycle of positional refinement. The refinement was completed 
with one cycle each or individual B value refinement with TNT and X- 
PLOR (Table II>. Manual changes or the model, other than placement of 
the H7I side chain, were needed only for the H7IA complex. 

Refinement of the structure or the uncomplexed HuLys H7I V Fv began 
wilh H7I changed to Gly. First, a round or rigid body refinement was 
conducted at 3.5-A resolution. Next tame two rounds or positional refine- 
ment at 2.9 A. alternating with model-building and addition or the H7 1 side 
chains. Group B values (I B per residue) were refined with X-PLOR, and 
a final cycle of positional refinement was performed (Table II). 

No solvent molecules are present in any of the models. PROCHECK 
(25) analyses of Ihe five structures show no residues in disallowed regions 
other than L5 1 . which is in a y-tum conformation, as seen in the native and 
olher related structures (26). 

Results 

H27S structure 

The structure of ihe HuLys Fv mutant H27S was determined as a 
lysozyme complex in a crystal form identical with the complex 
structure obtained previously (17). The crystallography asymmet- 
ric unit contains two Fv:Ag complexes, which we designate mol- 
ecule I and molecule 2. Both Fvs superpose well on the corre- 
sponding Fvs of the H27F structure, wilh rool mean square (rms) 
differences in Ca position of 0.5 A for each of the two complexes. 
Despite these idenlical rms differences, two different conforma- 
tions are present in the two cryslallographically independent H27S 
molecules. Comparing molecule 1 of H27F and H27S. differences 
in Ca position of up lo 2.7 A occur at residues H23-H29, adjacent 
lo CDR-H1. The overall effect is that in ihe H27S structure, this 
portion of Ihe molecule has moved away from the position of the 



Table II. Refinement 



Parameter 



:7S Complex 


H71V Complex 


H7IA Complex 


H7IR Complex 


H71V 


10.0-2.7 


10.0-2.7 


10.0-2.7 


10.0-2.7 


10.0-2.9 


20.005 


19.501 


18.592 


19,535 


14,455 


18,105 


17.642 


16.805 


17,653 


13.014 


1.900 


1.859 


1.787 


1.882 


1,441 


5,478 


5.486 


5,482 


5.494 


3,484 


0.203 


0.202 


0.202 


0.207 


0.225 


0.313 


0.291 


0.291 


0.297 


0.279 


0.015 


0.014 


0.015 


0.014 


0.021 


1.8 • 


1.8 


1.9 


1.8 


2.5 


80.9 


81.5 


80.5 


78.8 


78.6 


0.33 


0.34 


0.33 


0.34 


0.37 



Resolution (A) 
Reflections 

Total (F>2o-) 

Working set 

Test set 
Atoms 
R value" 

Working 

Free 

rms deviation from ideality 

Bond lengths (A) 

Bond angles f°) 
PROCHECK analysis 

% in most favored regions 
Estimated error in atomic position (A)* 



" " - Ski I If'*™ I - I FtlllCH, I I / fll I f-A'Mdl. 

* Calculated by melhod of Luzzali (27). 
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FIGURE I. Stniciutal effects of different amino 
acids al position H27, molecule I. Stereo view of 
H chain CDR I and adjacent peptide segment in 
H27F (black) and H27S (gray). Atomic coordi- 
nates were taken from molecule I from the H27F 
and H27S Fv-lysozyme complex structures. The 
entire H27S-Iysozyme complex was superposed 
on the H27F-lysozyme complex. The superposed 
molecules were used for this illustration. Only the 
peptide backbone from residues H22 to H35 and 
the side chains from residues H27 and H28 were 
drawn, to make clear the conformational changes 
that occur when Ser or Phc is substituted at posi- 
tion H27. Residue H27 in the H27S mutant was 
modeled as Oly. 



M s 
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Phe side chain present in H27F, toward the H chain N terminus and 
lysozyme. creating a more open loop (Fig. 1). Residues H74-H76, 
which pack against CDR-HI, have moved into the space created 
by this shift. Eight of the Cor shifts larger than twice the rms dif- 
ference come from residues H23-H29 and H74 and H76. (The 
others are at chain termini or at locations remote from the com- 
bining site.) Modeling of residues H22-H3I was difficult, and the 
side chain at position H27 could not be fit at all. The possibility 
exists that this remodeled region is in more than one conformation. 

H27S molecule 2 shows a clear difference from the correspond- 
ing molecule 2 of the H27F Fv. The Ser and Phe side chains at the 
substitution site point in opposite directions. As evident in Fig. 2, 
the phenyl ring of H27F is buried in the interior of the extended 
loop formed by residues H23-H35. whereas the Ser side chain in 
H27S points to the aqueous exterior. As predicted (4, 9), substi- 
tution of Ser lor Phe has created a cavity. Residue Ser H28 in the 
H27S Fv has shifted so that its main chain and side chain have 
moved into space occupied by the Phe H27 side chain in the H27F 
Fv. This large perturbation in backbone conformation extends for 
several residue positions along the peptide backbone, as is evident 
in Fig. 2. The shifts of the Co: atoms of residues H23-H3 1 account 
for 9 of the 13 shifts greater than twice the rms difference between 
the mutant and native complexes. A shift at H75 accounts for one 
more, and the others are at chain termini. 



Although the conformation of the loop preceding CDR-HI dif- 
fers significantly in H27S and H27F, structural effects on lysozyme 
binding are small. In the D1.3 complex structure, residue H32 of 
CDR-HI makes a weak (3.5 A) direct contact with lysozyme. Res- 
idues H30 and H3I make contact via water molecules (13). In the 
HuLys H27S structure, the distance for die potential direct conlacl 
between H32 and lysozyme is 4.1 A (molecule I) or 4.3 A (mol- 
ecule 2), similar to the 4.0-A conlacl seen in the H27F molecule I 
complex and an increase from the 3.4 A in the H27F molecule 2 
complex, and too large to be important in lysozyme binding (28). 
Due to the resolution of x-ray data for the HuLys complexes, we 
have not modeled water molecules, hence we cannot directly com- 
pare the Fv-lysozyme interactions involving residues H30 and H3 1 
to the corresponding interactions in DI.3. However, we did com- 
pare the positions of the Fv atoms in H27S and H27F involved in 
these contacts, the carbonyl oxygen atoms of H30 and H3I. Both 
these atoms in H27S molecule I have moved 0.8 A from their 
positions in H27F. In H27S molecule 2, the backbone Ccr atoms of 
these residues have moved 1 .9 A (H30) and 1.2 A (H3I) from their 
positions in the H27F complex. The aloms actually forming the 
contacts, H30 O and H31 O, have moved 1.7 A and 0.9 A, re- 
spectively. The size of this shift does not necessarily mean that 
these contacts are broken. The water molecules in the H27S 
complex presumably could shift position to accommodate the new 




FIGURE 2. Structural effects of different 
amino acids al position H27, molecule 2. Stereo 
view of H chain CDR I and adjaceni peptide seg- 
meni in H27F (black) and H27S (gray). The illus- 
uaiion was prepared as for Fig. I, but using 
atomic coordinates from molecule 2 of the H27F 
and H27S Fv-lysozyme complex structures. 
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Table III. Rms changes in Ca position of H71 mutants relative to 
H7IK 



Mutant 




RMS Deviation (A) 


Molecule 1 


Molecule 2 


H7IV 


0.3 


0.3 


H71A 


0.2 


0.2 


H7IR 


0.1 


0.1 



positions of Ihe protein atoms. The remainder of CDR-H I in H27S 
is offset from its location in H27F, with Ihe respective chains back 
in register by residue H35, Ihe last residue in the CDR. 

H7I structures 

The size of Ihe side chain at position H7I is thought to control the 
relative disposition of loops forming CDR-H I and CDR-H2 (29). 
Previously published structures of free HuLys Fv and the HuLys- 
Iysozyme complex had Lys in this position. Here we report addi- 
tional structures with Val. Ala, and Arg at residue H7I. All three 
forms crystallized and were determined as an Fv-lysozyme com- 
plex, and a structure of the free H7IV Fv was obtained as well. 

AH the Fv-lysozyme complexes were virtually identical. Super- 
position of Ihe Ca atoms of the mutant complexes onto the H71K 
complex gave small rms differences of 0.3 A or less, as presented 
in Table III. Twelve Ca atoms in the two H7I V molecules have 
shifts greater than twice the rms differences, and none are near Ihe 
combining site. Comparing the structures of the H7IA and H7IK 
complexes, four Ca atoms have shifts greater than twice the rms 
difference; three are in the L chain and one is in lysozyme. All are 
remote from the combining site. The most conservative H7I sub- 
stitution, arginine for lysine, gave the smallest overall rms differ- 
ence. However, as for the other H7I mutants, there were moderate 
shifts of Ihe mutated residue and residues in the nearby segment of 
polypeptide chain. The Ca atoms of H71 in molecules I and 2 
moved 0.5 A and 0.3 A, respectively, and the preceding Ca aloms 
in molecule I, H69 and H70, moved 0.2 A and 0.4 A. All other 
shifts greater than twice Ihe rms distance occurred distant from 
H71 and from the combining site. Fig. 3 shows superposition of 
H71 and parts of CDR-H 1 and CDR-H2 for the four molecules, 
taken from the complexed crystal forms. This illustration shows 
clearly (hat there is no change in structure of the two CDRs. de- 
spite the mutations at H71. 




FIGURE 3. Structure of residue H7I and first and second hyDervariable 
loops in four lysozyme-Fv complexes. Conformations of these residues in 
the two crystallographically independent asymmetric units of all four mu- 
tants are essentially identical. This illustration is a composite of superposed 
molecule 2s seen in H7IK (Mack line), H7IV (gray line), H71A I dolled 
line), and H7IR (dashed line). Superpositions were based on H7IK mol- 
ecule 2 and used the Ca atomic coordinates of the residues shown. 




FIGURE 4. Suuctures of residue H7 1 and first and second hypervari- 
able loops in unliganded Fv molecules. Top, H7I V molecule (gray line) 1 
superposed on H71K molecule I (black line). Middle, H7IV molecule 2 
(gray) superposed on H7 1 K molecule 2 (black). Bottom, H7IK molecule 2 
(gray) superposed on H7IK molecule I (black). 



The structures of uncomplexed H7IV and H71K offer another 
opportunity to test for a mutation-induced conformation change 
following the Tramontano model. The two unliganded crystal 
forms of H71K and H7I V each have two molecules in the asym- 
metric unit, hence comprise a total of four independent Fv struc- 
tures. Molecule 1 of H71K and molecule 1 of H7IV superpose 
almost exactly in the region of the mutation, as evident in Fig. 4, 
top. Molecule 2 of H7IK and molecule 2 of H7 1 V superpose sim- 
ilarly well (Fig. 4, middle). However, these two pairs represent 
distinct conformations. The two independent molecules of H71K 
do not superpose well (Fig. 4, bottom), and the same is true for 
molecule 1 and molecule 2 of H71V. In other words, two Fvs 
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differing al residue H7I, bul in identical crystal packing environ- 
ments, are closer in conformation than two with the same se- 
quence, but in different environments. The two unligandcd con- 
formations observed presumably arc distinct because of crystal 
packing interactions, rather than amino acid sequence differences 
at residue 71. The conformation of H7I and the two loops in the 
H7 1 K and H7 1 V Fv-lysozyme complexes is intermediate between 
the two conformations in the unligandcd structures, though closer 
to molecule 2 (rms difference 0.2 A for the Cat atoms in the illus- 
tration superposed on molecule 2 of the H7IK complex) than to 
molecule 1 (0.4 A rms difference). 

Discussion 

The role of residues H26-H30 in Ag binding by humanized Abs 
has been ambiguous. This segment is not considered part of "Ra- 
bat" CDR-HI (residues H3I-H35), and these residues rarely con- 
tact Ag (30). Other homology- and structure-based definitions of 
the first Ig H chain CDR have similarly designated residues outside 
this segment, viz H3I-H37 (31). H3I-H32 (32), and H30-H35 
(33). One exception is the canonical H chain hypervariablc loop I 
proposed by Chothia and Lesk (34). extending from H26 to H32. 
The rationale lor this assignment was that the segment forms a 
loop connecting two /3-slrands of rather standard geometry. The 
conservation of particular features in the N-terminal portion Df the 
segment, such as an invariant Gly al position H26, was considered 
critical for maintaining the backbone conformation of the Ag-con- 
lacling C-terminal portion of the H26-H32 loop. 

How are conformational changes in CDR-HI transmitted from 
the H26-H30 region? Comparison of side-by-side crystal struc- 
tures of mouse and humanized versions of the same Ab would 
seem a straightforward way to discover this mechanism, as iden- 
tical CDR-H I sequences are abutted in the two cases to H26-H30 
regions of separate murine and human origin. However, existing 
structural data on humanized Abs have been equivocal. 

The canonical H26-H32 structure, which the vast majority of 
Abs adopt (35), is typified by the human H chain NEWM (14, 34). 
The rat anti-CD52 Ab CAMPATH- 1 G, with H26-H30 sequence 
GFTFT, follows this canonical structure precisely (36). The initial 
humanized form, though based on NEWM frameworks, sequence 
GSTFS, bound Ag poorly, and probably did not adopt a canonical 
conformation. The crystallographically studied humanized form, 
CAMPATH- 1 H, had higher affinity by virtue of the H26-H30 re- 
gion being reverted to the rat sequence. Nevertheless, this structure 
still differed from the canonical conformation at residues H29 and 
H30. This deviation was attributed to a different interaction with 
respective side chains al position H7 1 (Arg in CAMPATH- IG, Val 
in CAMPATH- 1 H). A recenl structure of the same humanized 
molecule in complex with an Ag mimotope showed that the H26- 
H32 loop was once again in the canonical conformation (37). 

CDR-HI of the murine anti-7-IFN Ab AF2 deviates in confor- 
mation from the canonical structure al each position, but is still 
topological^ recognizable as a loop (38). In contrast, the human- 
ized version of AF2, despite having an identical sequence from 
residue H I9-H66, has an a-helical CDR-H I not seen in any other 
Ab structure. This unique conformation was attributed to a second 
structural rearrangement in framework I associated with a Pro 
(mouse) to Ser (humanized) mutation al position H7. 

The murine anti-lysozyme Ab DI.3 has a canonical CDR- HI 
structure (II). The humanized version of D 1 .3 whose structure we 
previously reported (16, 17) has an identical sequence from H26- 
H35 (7) (H26-H30 sequence GFSLT) and also adopts a canonical 
CDR-HI conformation. A kinetic study of HuLys mutants showed 
that a Ser substitution at residue H27 had only a slightly detrimen- 
tal effect on Ag affinity (9). This observation was contrary to the 



HUMANIZED ANTPOTSOZYME CRYSTAL STRUCTURES 

profound effect of a Ser-to-Phe mutation in CAMPATH- 1 H, even 
though both HuLys and CAMPATH- 1 H used NEWM framework 
sequences (4). One possible explanation is that the mutation in 
HuLys caused no significant structural change. The finding that 
CDR-HI of D1.3 contributes little free energy toward iysozyme 
binding (39) makes plausible an alternative possibility, that the 
mutation did cause a change in residues H26-H30. but this per- 
turbation was not delectable by kinetic analysis. Crystallographic 
data presented here favor the latter proposition, made clear in Fig. 
I. The HuLys H27S structure shows large changes in backbone 
conformaiion in residues H22-H30 in molecule I and H26-H30 in 
molecule 2. bul these torsional changes are not transmitted to the 
nearby Ag binding residues H31 and H32. Translalional changes 
are also not transmitted to these residues, except for a displacement 
of H3I in molecule 2. Given our findings and the apparent idio- 
syncrasies observed in other humanized Ab structures, we can only 
conclude that the conformaiion of CDR-H I and the adjacent H26- 
H30 region are cxlremely sensitive both to their own sequences 
and to interactions wilh adjacenl residues. Our understanding of 
structural determinants of H26-H35 and our ability to rationally 
manipulate this region remain limited. 

Tramonlano et al. (29) have articulated a descriptive and pre- 
dictive model for the structures of the H chain hypervariable loops 
1 (Kabal residues H26-H32) and 2 (Rabat residues H52a/53- 
H55). In this model, the most important determinants of ihe con- 
formaiion of hypervariable loop 2 are Ihe length of the loop and 
specific sequence constraints, wilh particular canonical structures 
and conserved residues expected for 3. 4, and 6 residue loops. A 
further structural determinant is Ihe side chain of residue H7I, 
which is significant in Ihe following way. The position of hyper- 
variable loop 1 is essentially fixed. The position of loop 2 relative 
10 loop I is variable, and depends on whether a large side chain at 
H7I packs between the two loops and separates them or a small 
side chain al H71 allows loops 1 and 2 to juxtapose. 

In HuLys crystal structures wilh four different side chains al 
residue H7I. ihe expected conformational rearrangement of the 
hypervariable loop 2 region is nol observed. The absence of a 
mulalion-induced conformation change cannot simply be due to 
the stabilizing effect of a bound Ag, because the Lys-10-Val mu- 
lation in the unligandcd crystal forms also does nol alter Ihe po- 
sition of loop 2. The modes! (0.4-0.6 kcal/mol) improvement in 
affinity that accompanied this mutation thus cannot be attributed to 
relieving an inappropriate displacement of hypervariable loop 2 
(9). Our findings do nol invalidate the Tramonlano model, for 
which other proof exists, including a specific mutational study of 
residue H7I in Ihe cryslallographically determined Ab B72.3 (40). 
Our data do demonslrate that a class of exceplions may exisi in 
which the H71 side chain alone does nol affect the separation of 
hypervariable loops I and 2. An unknown sequence determinant 
may override the action of H7 1 , or the compact nature of 3-residue 
hypervariable loops (HS3-H55) may confer less sensitivity to ihe 
bulk of the H71 side chain. 

The observation thai significant conformational changes in the 
H27S mutant did nol lead lo much change in Ag affinity, whereas 
substitutions at H7I gave affinity differences, but no apparenl ex- 
plicatory change in structure illustrates the value of combining 
structural and kinetic studies. 
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Summary 

During a germinal center reaction, random mutations are introduced into immunoglobulin V 
genes to Increase the affinity of antibody molecules and to further diversify the B cell reper- 
toire. Antigen-directed selection of B cell clones that generate high affinity surface Ig results in 
the affinity maturation or the antibody response. The mutations of Ig genes are typically base- 
pair substitutions, although DNA insertions and deletions have been reported to occur at a low 
frequency. In this study, we describe five insertion and four deletion events in otherwise so- 
matically mutated V H gene cDNA molecules. Two of these insertions and all four deletions 
were obtained through the sequencing of 395 cDNA clones (~1 10,000 nucleotides) from 
CD38 + IgD~ germinal center, and CD38"IgD~ memory B cell populations from a single hu- 
man tonsil. No germline genes that could have encoded these six cDNA clones were found af- 
ter an extensive characterization of the genomic V H 4 repertoire of the tonsil donor. These six 
insertions or deletions and three additional insertion events isolated from other sources oc- 
curred as triplets or multiples thereof, leaving the transcripts in frame. Additionally, 8 of 9 of 
these events occurred in the CDR1 or CDR2, following a pattern consistent with selection, 
and making it unlikely that these events were artifacts of the experimental system. The lack of 
similar Instances in unmutated lgD*CD38" follicular mantle cDNA clones statistically associ- 
ates these events to the somatic hypermutation process (P = 0.014). Close scrutiny of the 9 in- 
sertion/deletion events reported here, and or 25 additional insertions or deletions collected from 
the literature, suggest that secondary structural elements in the DNA sequences capable of pro- 
ducing loop intermediates may be a prerequisite in most instances. Furthermore, these events 
most frequently involve sequence motifs resembling known intrinsic hotspots of somatic hy- 
permutation. These insertion/deletion events are consistent with models of somatic hypermu- 
tation involving an unstable polymerase enzyme complex lacking proofreading capabilities, and 
suggest a downregulation or alteration or DNA repair at the V locus during the hypermutation 
process. 



During the course of a T cell-dependent antibody re- 
sponse, B cells hone the specificity of their antibody 
molecules through a process oi random somatic hypermu- 
tation of their V genes, followed by antigen driven selec- 
tion. This is collectively referred to as affinity maturation. 
This process occurs within the germinal centers (GCs) 1 of 
secondary follicles from peripheral lymphoid organs when 



1 Abbreviations used in this paper: FM. follicular mantle: FW. framework; GC, 
germinal center. 



antigen stimulated B cells receive proper signals from T and 
accessory cells. In the human system, GC B cells are char- 
acterized by the surface expression of CD38 and, in most 
cases, the loss of IgD (1-3). We have previously shown that 
the initiation of somatic hypermutation occurs within the 
CD77 + subset of these IgD-CD38-> B cells (4). Mutated V 
genes can be isolated from all subsequent stages of B cell 
differentiation and in cells from all IgD" and certain lgD + 
B cell subsets (4, 5). The molecular process of somatic hy- 
permutation remains elusive, primarily due to the lack of a 
good In vitro model until very recently (6). Much of what 
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is known concerns: (a) localizing the somatic hypermuta- 
tion process to particular B cell subsets and anatomical set- 
tings (4, 7-10); (6) delineating the limits and rates of muta- 
tional activity (II); (c) determining the minimal substrate 
through transgenic technology (12, 13); and (d) analyzing 
the mutations themselves in the context of the surrounding 
sequence to reveal tendencies such as strand polarity and 
"hotspots" of somatic hypermutation (for reviews see refer- 
ences 12 and 13). 

Although somatic hypermutation is typically described as 
the generation of bp substitutions, insertions and deletions 
have been sporadically described. As with somatic point 
mutations, the analysis of these events can provide valuable 
information concerning somatic hypermutation itself. Anal- 
ysis of human V H 4 family genes generated from the ampli- 
fication of cDNA from somatically mutated GC (IgD" 
CD38+) and memory (IgD"CD38-) B cell subpopulations 
led us to identify a number of cDNA clones from the mu- 
tated cell populations that contained Insertions and dele- 
tions. We provide evidence that these events are linked to 
the somatic hypermutation process. Additionally, these 
events occur in a predictable fashion relative to the sur- 
rounding sequence, suggesting a model for their occur- 
rence with implications for the molecular process of so- 
matic hypermutation. 



Materials and Methods 

Isolation. Labeling, and Sorting of Tonsil B Cells. Human tonsils 
were obtained during routine tonsillectomy. B cell Isolation and 
sorting for CD38 and IgD expression were performed as previ- 
ously described (4, 1 4). In brief, human tonsillar B cells were sep- 
arated inco lgD*CD38" follicular mantle (FM) B cells, IgD" 
CD38* GC B cells, and IgD"CD38- memory B cells to 95-98% 
purity as predicted by FACS" analysis, as previously described (13). 
The mutation state of the V H gene cDNA clones from the various 
subpopulations was in agreement with our previous study (4). 
Clones were considered somatically mutated If they contained two 
or more bp substitutions, well beyond the expected error rates for 
the avian myeloblastosis virus reverse transcriptase (AMV-RT). 
Taq, and PFU polymerases used in these analyses (this mutation 
rate is based on our previous analyses; reference 4). 

Sequencing the Ig V H Transcripts. Total RNA was extracted 
from 1-5 X 10 s B cells using guanidinium thiocyanate-phenol- 
chloroform in a single step using the Ultraspec RNA isolation 
system (BIOTECX Laboratories, Houston. TX). and was reverse 
transcribed using oligo-d(T) or specific V gene constant region 
oligonucleotides Cu.12 (5'-CTCGACTTTGCACACCAC- 
GTC-3') for IgM transcripts or C7I8O (5'-CTGCTGACC- 
GAGTAGAGTCC-3) for IgG transcripts and SuperScript II reverse 
transcriptase (CIBCO BRL. Galihersburg, MD). First strand cDNA 
was used directly for second strand synthesis and amplification via 
PCR using internal primers corresponding to the Cu. or C7 con- 
stant regions in combination with V H 4 or V H 6 family-specific 
leader oligonucleotides: C-yl40. 5'-GCCAAGGTGTGCACGCC- 
GCTG-3'; Cu.10, 5'-TCTGTCCC CTGCATGACCTC-3': L-4. 
5 ' - ATC AAACAC CTGTG GTTCTT-3' ; L-6, 5'-ATGTCTGT- 
CTCCTTCCTCAT-3'. The PCR products were purified using 
microconcentrators (Amlcon, Beverly, MA), and then were ki- 
nased and blunt-end ligated into an EcoRV-dlgested and dephos- 



phorylated pBluescript plasmid (Stratagene, La Jolla, CA; Polynu- 
cleotide Kinase, T4 DNA Ligase, and EcoRV were from Boehringer 
Mannheim, Amsterdam, Netherlands). After transformation by 
electroporation Into electro-competent DHlOa Escherichia coli 
(CIBCO BRL) and screening with consensus Internal oligonu- 
cleotides as previously described (4. 15), positive colonies were 
picked, plasmid mini-preparations were made, and colonies were 
sequenced in both directions using an automated DNA sequencer 
and automated sequencer protocol (ABI-377; Advanced Biotech- 
nologies Inc.. Columbia, MD). All sequences were analyzed us- 
ing DNAstar (DNAstar Inc.. Madison, WI). In the first tonsil an- 
alyzed, 583 clones were picked, plasmid mini-preparations were 
made, and Southern blots were prepared by standard methods. 
These blots were screened with a set of oligonucleotides specific 
for the various V H 4 family genes. Only those clones that screened 
positive wid> constant region probes but negative for the various 
V H 4 complementarity-determining region (CDR)l-speclfic probes 
were sequenced (395 of 583 clones), thus enriching the somati- 
cally mutated populations analyzed, in that the CDR1 probes 
should anneal only to the sequences most similar to germllne. 
The frequency of the occurrence of these events can therefore 
only be predicted to be between 6 out of 395 and 6 out of 583 
clones (1-2%). Any sequence of interest was resequenced In both 
directions to ensure sequence fidelity. 

Characterizing the Genomic Repenolre. Total genomic DNA was 
Isolated from FM B cells (IgD*. CD38") using the Puregene DNA 
Isolation kit (Centra Systems, Inc., Minneapolis, MN). V H 4 genes 
were amplified using a V H 4 leader-specific primer (L-4, as above) 
and a primer specific for all V H 4 gene family heptamer-nonamer 
spacer regions as previously described (16). PCR products were aga- 
rose gel purified, then cloned into E. coll as described above for 
the cDNA clones. Clones identified in the cDNA analysis that con- 
tained insertions or deletions were used to design PCR primers to 
amplify both the exact sequence of clones with Insertions/dele- 
tions as found and the predicted sequences based on the proposed 
germline counterparts. Oligonucleotides used in this analysis 
(Format. Is as follows: clone: exact/predicted): g64:5'-GCACGC- 
CTTGTACTTGGTTCC-375'-CCACCGCTTCTACCTC- 
TCC-3'; gl44:5'-TCTTGAGGGACGGGTTGGTGT-375'- 
TCTTCAGCGACGGGTTGT-3'; gl87:5'-CAGCTCCAGTAG- 
TAAGCCCCC-375'-CAGCTCCAGTAGTAACCACCG; E 188: 
5'-CACCGATTCTACTTGGAGCC-375'-GAGGCGTTGT- 
AGTTCCTCCC; g 1 92:5' -CCAGCCCCAGTAGTAGTAACT- 
37(same); and g 80:5'-CCGGATCCAATACCTCACACT-37 
5'-GCCCATCCACTAGTAACC-3'. 

Sequence Availability. All cDNA sequences with insertions or 
deletions, and any genomic sequences unique to the literature as de- 
scribed In the results section are available from EMBL/Cenbank/ 
DDBJ under accession numbers AF013615 through AF013626. 

Assay for Screening V H Gene Lengths. To facilitate the analysis 
of large numbers of Vh gene transcripts for the presence of Inser- 
tions or deletions, first strand cDNA produced as described above 
was PCR amplified using Expand high fidelity polymerase (Boeh- 
ringer Mannheim) to reduce errors resulting from Taq poly- 
merase alone. The producls of this PCR amplification were cloned 
as described above and screened using 3z P-labeled, gene-specific oli- 
gonucleotides (V H 4 -39: 5' - ATTC GG AGTATCTATTATAGT-3 ' ; 
L-6 as above). Positive colonies were picked and used to inocu- 
late overnight cultures. A 1 pi aliquot from each 24-h culture was 
used to directly inoculate 25-u.l PCR amplification mixtures in 
96-well-rormat PCRs. The internal PCR reactions used 32 P-labeled. 
gene-specific oligonucleotides to amplify a 230-base fragment in- 
cluding the V H 4-39 CDRI (L-4. as above, and V H 4-39-3': 5'- 
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A) Insertion events from a single tonsil' 



9-39 0L0LQESCSGLVItP«tTLSLTCtVS«:fiISS£3YWGWIR0?PGKCLEUICSITYSCSTYY HPS WSKVTI SVOTSKNOrSL«L£SVT!UUyillVYYCA» 
«H* — R G t-S V L S — T T H If 

CPP.I CPU 
4-39 OLQLOt8CPCI.VKP8ETtSI.TCTVBGaSlSSG0 YYWCWIRCPHIKCUHI&SrYYSSSTYYMPSLKSfcVTIBVOTSKMOrflUiLifiVTJUUJTXVYYCAJl 
yl93 ----K »---*- -A — -Y V 



B) Deletion events from a single tonsil: 



«-S» OV010S«POLVKP3r.7LaLTCTV30CSV«3YYW3WIIlOPPGkCLKW10YIYYWmY>JPSLMIlVTleV^SWrstItWSVTJUUyTAVTYCAP. 
g!99 1 1 L-fi V--K-E S — S — T P. S- 



*-3l OVOLOteGPCLVKPtXJTLSLTCTVSWElSS^^^UlJHKWGLeUIGYfYTKij^mlpctREI 

gl" —« 1— -AfU— 



ICAVTI SVWSlQiOrEUCLSSVTAAIYFAVYYCA 
A-T-C— I H-l-V-LY— WH— r-»-G— 



,C0«,1_ ... 
*-3! OVvLOESGPCEVKPS v TLSLTCTVSGGSISS^YYWSE'IP«HPGKGL^ 

9" « A" » t IK 9 «-T AGPf • ■ 



9-M QVCLC^AGLLKPSITLSLTCAVY«;srsGYYVPM3AvPPGKGU*lC*IHK^ 

C) Other insertions: 

V„E VOLOQflGPGWXPSCrtSL-fCAlSGDSVSS HEUbAVWIROSPSAGLEWUiATYYUKWY^rYAVfiVKSKITIHPDTSXMQffSLCLMSVYPEDTAVYYCAA 



V.<-31 UQWJIBCPGtVKPEOTlELTCTVSGGSI ECCYYWEW19C«PGItCHUIGYI YYb'CETYYNPE LK£*VTlSVDTBlWOPELKt.SEVrAADTAVYY 
pg8« -P O-FTEGDBf f - L - i. H-C-- -G-D 1.-HTI - 



VQI^ECPGWKPSOTLSLTCAIECDSVSSMBMWNWIEDEPSRGUEWWATTYEEKWYHOYAVSVXSBITIHPDTSKM QFELOmsVYPEOTAVYYCAX 



Figure 1. Predicted amino 
acid sequences of nirte cDNA 
clones wilh insertions or dele- 
lions. Note that all of these clones 
are extensively mutated, in all but 
clone tm 121. the insertions or de- 
letions occur in CDR1 orCDR2. 
(A and Bj Two clones with inser- 
tions (-4) and four clones with de- 
letions {Bj from a single tonsil. (Q 
Three additional cDNA clones 
with insertions isolated from vari- 
ous sources. Sequence data avail- 
able from CenBank/DDBJ under 
accession numbers AF0I3615 
through AF0I3S26. 



GCTCCCACTATAATAGATACT-3') or for analysis of V„6 
genes a 166-nucleotide fragment including the CDRI and CDR2 
or V H 6 (V H 6FW1: 5'-TGCCATCTCCGCGGACACTGT-3'. 
V H 6FW3: 5'-TGTGTCTGGGTTGATCGTTAT-3'). Allquots 
of each clone were also used to inoculate amplifications of CDR3 
regions using FW3-speclfic (ssFW3: 5'-CTGAA|C/C]CT- 
GAGCTCTCTGAC|T/C1) and Cu,- or C'y-speclfic oligonucle- 
otides (Cp-D: 5' -GCAATTCTCACAGGAGACGA-3', C-/-140 
as above) to analyze the diversity of the populations under study; 
the distribution of CDR3 size variations of several hundred V H 
sequences cloned in this analysis were used to produce an ex- 
pected distribution of CDR3 sizes for comparison (see Fig. 5 B). 
The amplification products were eleclrophoresed on 0.6X-TBE. 
5% urea-acrylamide sequencing gels (Long Ranger: J.T. Baker. 
Philllp5burg, NJ) and analyzed with a Phosphorlmager (Molecu- 
lar Dynamics. Sunnyvale. CA.) using the Image Quant software 
supplied by the manufacturer. Clones that differed from the ex- 
pected size and those clones in lanes adjacent to aberrantly mi- 
grated bands were used to produce plasmid preparations from 
which the inserts were sequenced in either direction. 

Spring of Insertion/Deletion Events. In the results section. In- 
sertions and deletions are scored as events per 10/ nucleotides 
within the customary boundaries of CDRI and CDR2. This unit 
was chosen because in the selected populations studied these events 
are generally only found in the CDR regions and therefore the 
comparison of events per total nucleotides would be misleading. 
In the PAGE analysis, each V H 4-39 FM clone included only the 
CDRI (21 nucleotides) within a total of 230 nucleotides/clone, 
whereas each V H 6 FM clone was only 166 nucleotides but in- 
cluded both the CDRI and CDR2 (75 CDR nucleotides). In the 
sequencing analysis, various B cell populations were analyzed In- 
volving a wide range of overall lengths. Comparisons of the fre- 
quency of insertions/deletions just within the CDRs allowed for 
a more standardized and quantitative analysis, and for more free- 
dom in experimental design. 

Baculovirus Expression System. Cloning and coexpression of 
clone pg86 and k light chain FS6k in the baculovirus expression 



system was performed as previously described (17). Recombinant 
Autographs California nuclear polyhedrosls virus (AcMNPV) was 
cloned using the pH360NX transfer vector and expressed In Sf9 
cells. 

Capture EL1SA for y Heavy Chain, and k Light Chains. Expres- 
sion of recombinant antibodies of clone pg86 coexpressed with K 
light chain FS6k were measured by capture ELISA. Wells were 
coaled with goat anti-human IgG and incubated with supernatant 
of recombinant pg86/FS6K added in serial twofold dilutions. 
Bound antibody was detected using alkaline phosphbiase-conju- 
gated goat anti-human IgG. or goal anti-human Ck. After I -h 
incubation at 37°C. phosphatase substrate was added and absor- 
bance was measured at 405 nm In an ELISA plate reader. 

Results 

Insertions and Deletions into Immunoglobulin V H Genes. 
In a large scale analysis of V H genes from both the IgM and 
IgG compartments of B cell subpopulations separated from 
a single human tonsil, six clones that contained DNA inser- 
tions or deletions were isolated. These insertions and dele- 
tions were apparently selected in that they involved nucle- 
otide triplets or multiples of nucleotide triplets, leaving the 
cDNAs (transcripts) in frame, and they were localized to the 
CDRI and CDR2 (Fig. 1, A and Bj. The six clones with 
insertions or deletions were identified from the sequencing 
or 395 cDNA clones M 10.000 nucleotides) from GC and 
memory B cell subpopulations, resulting in a frequency of 
<2% of clones analyzed (~1 event/18,000 nucleotides). 
All six events were in IgG transcripts. Two events were ob- 
tained Tram IgD~CD38 + GC and four events from 
lgD~CD38" memory cell populations. None of the IgM Vh 
cDNAs analyzed from this tonsil had insertions or deletions, 
although we have observed such events In IgM transcripts 
in the past and in subsequent analyses, as described below. 
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Figure 2. Comparison of ihe CDRls of the human V H 4 germline 
genes. The primary variability between V H 4 family members Is 3-6-bp 
size variances in the CDRls which Is similar to the short insertions and 
deletions that we artrlbute to somatic hypermutation In the selected B cell 
populations studied in this reporr. 



The Insertions and Deletions Are Not Cermiine Encoded. 
The analysis described above focused on the V H 4 gene 
family, which consists of 10-14 members/genome, varying 
slightly between individuals (16. 18). As shown in Fig. 2. 
the major difference between V H 4 genes involves the length 
of CDR1. Because genomic diversity between Vh4 family 
members resembles the events described in this paper we 
had to rule out possible alternative explanations for these 
events, such as: (a) different alleles of the detected genes; (/>) 
rarely expressed or otherwise unknown V^4 gene family 
members; or (c) hybrids between known and detected V H 
genes and/or other artifacts of the experimental system. To 
address these issues, both the expressed and genomic reper- 
toires from this tonsil were characterized. As indicated in 
Table i, 2 out of 118 V H 4-39. 2 out of 49 V H 4-31, 1 out or 
87 V H 4-34. and 1 out of 45 V H 4-59 cDNA clones contained 
Insertion/deletion events. cDNA clones were Judged as 
unique Isolates based on CDR3 analysis, and the few Iso- 
lates that appeared to be clonally related differed in their 
patterns of somatic mutation beyond the level explainable 
by reverse transcription and PCR errors (maximum: >1 
mutation/500 nucleotides of V H gene sequence as previ- 
ously described 14|). 

To characterize the genomic repertoire of the initial ton- 
sil. 80 germline V H 4 gene clones were isolated and se- 
quenced (Table 1), which encompassed all 14 Vn4 family 
members or alternate alleles represented in the 446 cDNA 
clones analyzed from all of the tonsillar B cell subsets. In 
the course of this study, we isolated the germline counter- 
part of a novel V H 4 gene segment for which transcripts had 
been found. In addition, germline genes corresponding to 
two apparently functional V H 4 genes not found as cDNA 
clones in this analysis were Isolated, as well as one nonfunc- 
tional V H 4 gene and a divergent polymorphism of a known 
V H 4 pseudogene. The proposed germline counterparts of 
each of the V H 4 genes containing insertion/deletion events 
were isolated from 4 to 11 times (Table 1). 8 independent 
genomic isolates of V H 4-31 and of V H 4-39 were cloned. 
V H 4-34 and V H 4-59 were isolated 1 1 and 4 times, respec- 
tively. No germline genes were isolated that could have en- 
coded the insertion/deletion events described. 

To further be certain that the insertion/deletion events 





cDNA 


Total 


Germline 


V H 4 gene 


clones with cDNA clones 


clones 


alleles isolated* 


ins/del 


Isolated 


isolated 1 


V H 4-39 


2 


113 


7 


V„4-3I 


2 


49 


8 


V„4-59 


1 


45 


4 


V H 4-34 


1 


87 


11 


V H 4-34 related 


0 


0 


4 


Vn4-55 pseudogene* 


0 


0 


12 


V H 4-55-relaled pseudogene 5 


0 


0 


3 


V H 4-04 


0 


17 


7 


V H 4-04- related pseudogene 5 


0 


0 


2 


V H 4-6I 


0 


25 


7 


New V H 4 genel 


0 


33 


3 


V„4-04B 


0 


72 


1 


V H 4-28 


0 


0 


1 



'Nomenclature based on Matsuda and Honjo (37). 
'Nine unusual isolates were also cloned consisting of hybrids of two of 
the indicated genes, resumably do to PCR artifact. None of these arti- 
facts were altered In size or resembled any of the insertion or deletion 
events observed. 

s Pseudoge»es contain stop codons or frameshift mutations and are not 
expressed. 

"Newly Identified V H 4 gene is most closely related to V H 4-Q4. 

described herein were not germline encoded, two sets of 
PCR primers were designed lo specifically recognize: (a) 
the exact sequence of ihe events; (fc) the predicted, unmu- 
tated, germline sequence corresponding lo the cDNAs con- 
taining insertion and deletion events. These primers were 
used to amplify genomic DNA from this individual, yield- 
ing negative results (data not shown). The unique nature or 
these events relative to both the expressed and genomic 
repertoire and our inability to amplify genomic counter- 
parts for these events by PCR suggest that they are not germ- 
line encoded. 

The Proposed Insertion/Deletion Events Are Not the Result of 
(Vh/Vh) Recombination. As in most V gene repertoire 
analyses, we detected hybrid V H sequences that could be 
the result of either PCR splicing by overlap extension arti- 
facts, or reciprocal homologous recombination between un- 
rearranged V genes (19). However, none or these likely artl- 
faclual events were altered in size such that they resembled 
the insertion or deletion of DNA described above. A num- 
ber of artifacts of this type had been isolated in the cDNA 
analysis as well; such artifacts are common to V gene analy- 
ses (20). The cDNA isolates with deletion and insertion 
events were stringently compared to all germline and 
cDNA isolates and were found to be unique relative to both 
the expressed and germline V H 4 gene repertoires of this in- 
dividual, supporting a somatic origin for their occurrence. 

77ie Insertions and Deletions Are Associated with Somatic Hy- 
permutation. To determine whether or not these inser- 
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tion/deletlon events were associated with somatic hyper- 
mutation, we analyzed their occurrence in unmutated FM 
transcripts. This was done using either direct sequencing or 
PCR amplification of portions of the V H genes spanning 
the CDRs, followed by size comparisons on polyacryl- 
amide gels (Fig. 3). Any clones that ran aberrantly, and the 
clones in adjacent lanes, were sequenced (75 out of the 485 
clones). None of these 75 clones were related based on 
CDR3 homology. To ensure that the remaining 410 FM 
clones were polyclonal, the CDR3s were PCR amplified 
and loaded on the sequencing gels simultaneously to the 
Vh gene amplification products for size comparisons (Fig. 
3 A). The size distribution of these CDR3s was similar to 
that of ~500 V H gene sequences analyzed in this study 
(Fig. 3 Bj. providing evidence that our FM sample is poly- 
clonal. 

The six events detected from a single tonsil were isolated 
from 395 mutated cDNA clones (25,482 CDR nucleo- 
tides), corresponding to a frequency of 2.35 events/10 4 CDR 
nucleotides. This is significantly different (p=0.014 by a 
one-sided x 2 test) from the analysis of unmutated FM- 
derived clones (25.515 CDR nucleotides) that yielded no 
insertions or deletions (Table 2). 

In the course of the analysis described above, we isolated 
one IgM clone containing a 6-nucleotide insertion into 
framework (FW)3 (see below). We believe that this clone 
is part of the mutated GC or memory repertoire because it 
contained 4 bp substitutions In addition to the Insertion. In 
this study, the B cell populations analyzed were 95-98% 
pure, and the FM B cell subpopulallon could therefore In- 
clude between 2 and 5% contaminating clones, that is, 
lgM-expressing cells not from the naive population that 
can therefore be somatically mutated. However, none of 



Table 2. Analysis of Unmutated FM cDNA Clones for Insertion or Deletion Events 



Clone type 


Clones 
assayed 


CDR 
nucleotides' 


Events 
observed 


Frequency* 


Expected (events/ IV 
CDR nucleotides)' 


Mutated V H 4 clones (GC and 


395 


25.482 


6 


2.35 events/ 




and memory B cells) 








10 4 inn 




Unmutated clones.' 












V„4-FM, CDR1* 


265 


5.565 


0 


0 


1.31 


V H 6 IgM FM V H genes* 


220 


16,500 


0 


0 


3.88 


V H 4 family FM sequences 


51 


3,450 


0 


0 


0.81 


Total unmutated values 




25.515 


0 


0 


2.35 events/10 4 CDR 








(P = 0.014)' 




nucleotides 



'Clones analyzed by hot-PC R/PAGE assay as described in the text. 

1 CDR nucleotides are those within the customary bounds of the CDRI and CDR2. (See Materials and Methods for a more detailed explanaUon of 
this unit). 

'Events per 10* CDR nucleotides. 

'Expected frequency (events/10 4 CDR nucleotides) derived from sequencing data: 6 evenu In 25.482 CDR nucleotides: 6/(25 482 CDR nucle- 
otldes/10 4 ) = 2.35. 

'Statistical analysis: x 2 test for Independence. 
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Figure 3. Polyacrylamide gel assay to Identify Insertions or deletions 
into V H genes. {A) Phosphonmage of a polyacrylamide gel: each lane con- 
lalns the hot-PC R products ( K P-labeled) or the V H gene and the CDRS 
or an individual clone. (S) A comparison of the distribution of CDR3 
sizes of the 485 CDR3s assayed to the disiribution of 500 CDR3s ob- 
served In sequences from this report indicates that the clones assayed by 
electrophoresis were a polyclonal population. CDR3 sizes were measured 
from the most 3' Tyr residue (common [o all V genes analyzed) to Ihe 
most 5' Cp. or C^ residue. CDR3 lengtlis for those assayed by electro- 
phoresis were extrapolated based on sequencing of 75 out of the 485 
clones assayed. The x-axis is the number of amino acids greater than the 
shortest CDR3 round. 
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A Insertion events: 



pgtfi •ntF^amnw tLISA 



V.6: GTC TCT AOC AAC AGT GCT 

HBpr: GTC TCT AGC AOC AAC AGT GCT 



v»4-19: TAC TAC AAC CCG TCC CTC 

gut: TAC TAC AoC AAC CCG TCC CTC 



v,«-39i AGT TAC TAC TGG GGC TGG 

111!: AGT TAC TAC TAC TGG GGC TGG 



v„6: TCC AAG AAC -— — CAG TTC 
1*121: TCC AAG AAC AAG AAC CAG TTC 



B Deletion events: 



Virt-3i! GGG AGC ACC TAC TAC AAC CCG 
g6<: GGa AcC Aag TAC AAC CCG 

v»<-31; TCC ATC AOC AGT GGT GGT TAC 
.3)8-1.- TtC ATC AGC GGG GCT TAC 

v.i-49: TAC AGT GGG AGC ACC AAC TAC 
glBUi TAC AGg GGg tCC AAC TAC 

V»4-J<.- TAC TAC TCG ACC TOO ATC CGC 
qao: agg TAc TGG ATC CGC 



C Long duplication/insertion 



••* com*"." 

CCACTCGTGGmCTACTGG 



Figure 4. The insertions and deletions are related to the surrounding 
DNA sequence. (A) The insertions involve repetitions of the Immediately 
adjacent sequence. (B) The deletions are deletions of tandem repeats. (C) 
The 18-base Insertion in clone pg86 is a duplication of the adjacent se- 
quence. Nucleoiides that mutated before the duplication/insertion are In- 
dicated. Sequence data available from EMBL/GenBank/DDBJ under ac- 
cession numbers AF0I3G15 through AF013626. 



the unmutatcd FM clones analyzed had insertions or dele- 
tions, 

Other Insertions and Deletions into V H Genes. We have ob- 
served similar instances of insertions and deletions into the 
coding regions of apparently functional immunoglobulin V 
genes, including: (a) a V H 6 Igtvf isolate containing a triplet 
duplication/insertion into the CDR1 in addition to several 
bp substitutions (Figs. 1 C and 4 A), which was derived 
from a human hybridoma secreting high affinity rrtAb 
against Bordctella pertussis (21, 22); (6) a 6-nucleotlde inser- 
tion into the FW3 region of a mutated lgfvl Vh6 gene, rep- 
resenting the only insertion or deletion observed outside of 
the CDRs (Figs. I Cand 4 A. clone tml21); and (r) an 18- 
nucleotide duplication/insertion into a human plasma cell 
cDNA transcript at the boundary between the FW1 and 
CDR1 (Figs. 1 C and 4 Q. doubling the length of this hy- 
pervariable loop. The viability of clone pg86 was tested by 
expressing it in the baculovirus system in association with a 
K light chain encoding construct (FS-6k; Fig. 5). The effi- 
cient expression, secretion, and pairing with light chain in 
the baculovirus system suggest that the product of clone 
pg86 is a functional heavy chain despite the large duplica- 
tion/insertion. 

The Insertions and Deletions Are Related to the Surrounding 
Sequence. As shown in Fig. 4. the insertions reported are 
duplications of the immediately adjacent sequence, and the 




pgBS .Ml k.piu, EllSA 




Figure 5. ELISA assays show- 
ing the expression of clone pg86 
with a six amino acid Insertion at 
the FW1/CDRI Junction. 
Clone pg86 (lgC heavy chain) 
was coexpressed with the k light 
chain FS6k in insect cells using 
the baculovirus expression sys- 
lem. F-xpression of pg86 and its 
ability to pair with the FS6k light 
chain was lested using capture 
ELISAs. Wells were coated with 
goat anti-human lgC. Superna- 
tants containing recomblnanl an- 
tibodies were added in serial 
twofold dilutions. Bound antibody 
was delected with phosphalase- 
conjugaled goat anti-human IgG 
(A), and goat ami-human Cx (5). 



deletions involve elements of repetitive tracts. In addition, 
a higher incidence of Ihese events involve sequence motifs 
that resemble intrinsic hotspots or soiralic hypermutation 
(12, 23-27): (a) four of eight events involved the serine 
codon ACC that has been reported as the "hottest" of 
hotspots (24-27) (Fig. 4, sequences HBp2. gl87, g)88, and 
g86); (i) two events involved TAC motifs (Fig. 4, g!92 and 
g64); and (c) (wo events involved the motif AAC (Fig. 1. 
g]44, and tml2l). In general 3)) of the clones found to 
contain insertions and deletions were highly mutated (Fig. 
I). Several of these clones had bp substitutions clustered 
with the insertions or deletions (Figs. 1 and 4). The plasma 
cell transcript depicted in Fig. 4 C contained an 18- nucle- 
otide Insertion that duplicated the 5' adjacent sequence. 
The central nine nucleotides of the duplicated sequence 
form a partial palindrome (..GGtGaCtCC). This clone 
was mutated (G to A at position 80 and an A to T at posi- 
tion 85) before the duplication/insertion event, as these 
mutations were perpetuated in the inserted sequence, 

Discussion 

Somatic modification of V genes encoding immunoglob- 
ulin and T cell receptors recapitulates most mechanisms 
observed In the evolutionary diversification of DNA: (a) V 
gene recombination, including imprecise junctions, P nu- 
cleotides, and untemplated N nucleotide addition; (b) gene 
conversion; and (r) bp substitutions in Ig somatic hypermu- 
tation. The insertion and deletion of nucleotides is another 
means for the evolutionary diversification of DNA, and has 
been proposed as an explanation for unusual V gene se- 
quences in the past (Table 3). In this study, we show that 
insertions and deletions are associated with the somatic hy- 
permutation process. 
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Complexities of the Analysis of Insertions and Deletions into V 
Cenes. The formal characterization of these events has 
been a daunting task because of their low frequency, and the 
complexity of the germline V H repertoire. According to 
our study, these events occur in <2% of somatically mu- 
tated clones. As shown in Fig. 2. the primary variability be- 
tween V|. t 4 family members is 3-6-bp size variances in the 
CDRls. which is comparable to the short insertions and 
deletions that we attribute to somatic hypermutation (in se- 
lected B cell populations). The similarity between evolu- 
tionary diversity and somatic diversification was expected, as 
the molecules are likely subject to the same functional and 
structural constraints. This has made it difficult to determine 
whether these events were generated somatically, versus germ- 
line encoded, or if they were artifacts of the experimental 
system: they could result from homologous recombination 
between alternate alleles or imperfect recombination be- 
tween identical alleles, or they could have occurred during 
B cell replication independent of somatic hypermutation. 
In fact, V H genes may exhibit particularly unstable se- 
quence characteristics evolved to help support both germ- 
line diversity and the generation of somatic mutations, as 
suggested by the identification of intrinsic hotspots of so- 
matic hypermutation within the CDRs of V genes (25, 26). 
Perhaps the area of greatest contention in this complex sys- 
tem remains the possibility that these low frequency events 
are artifacts of trie experimental manipulations performed, 
the AMV-RT, Taq, or PFU polymerases, and/or the clon- 
ing in E. coli. 

The Insertion/Deletion Events Are The Result of the Somatic 
Hypermutation Process. Our system addresses several key 
issues that associate the occurrence of Insertions and dele- 
tions to the somatic hypermutation process, (a) Six of the 
nine insertions/deletions were identified within the V H 4 
gene repertoire of a single tonsil, providing an experimen- 
tal system that could be characterized extensively as de- 
scribed below, (b) All of the insertion/deletion events re- 
ported involved triplets or multiples of triplets, leaving the 
transcripts in frame and therefore functional, and eight of 
nine events reported were localized to the CDRs. As with 
somatic point mutations, no insertions or deletions were 
observed in the 80 to 120 nucleotides of constant region 
(Cu. or C-y) DNA sequenced with each cDNA clone. 
These hallmarks of somatic hypermutation and selection ar- 
gue strongly that these events are not artifacts, (c) The B 
cells analyzed were processed and separated into highly 
pure, mutated B cell populations including GC (IgD - 
CD38"*) and memory (IgD"CD38~) B cells, and an unmu- 
tated FM B cell population (IgD"CD38"). making it possi- 
ble to focus our analysis on the mutated populations and 
use the unmutated population as a negative control, which 
in turn allows the statistical association of the observed in- 
sertion and deletions to the somatic hypermutation process 
(P = 0.014). In addition, the isolation of four of the Inser- 
tion/deletion events from memory B cells provides evi- 
dence that these events did not result from artifacts related 
to contamination from endonucleolytically cleaved DNA 
from the apoptotic GC cells, (d) Seven of nine events re- 



ported in this study involved 7 heavy chains that contain 
nearly twice the mutations of p. heavy chains (4), further 
correlating the events described here to somatic hypermu- 
tation. (e) As discussed below, the insertion/deletion events 
described tended to involve sequence motifs resembling 
previously described hotspots of somatic hypermutation. pro- 
viding evidence that these events occur by the same pro- 
cess. (!) Finally, we extensively analyzed the V H 4 gene fam- 
ily of the tonsil donor at both the expressed and genomic 
levels, facilitating the assignment of the insertions/deletions 
as somatic rather than germline encoded. 6 of the clones 
with insertions and deletions were unique among 395 V H 4 
cDNA clones sequenced from a single tonsil, including many 
independent isolates of each of the Vn4 genes expressed 
(Table 1). In addition, we were unable to isolate genomic 
templates for any of the insertion or deletion events either 
by PCR or through the extensive characterization of the 
genomic V H 4 repertoire of the tonsil donor (Table 1). Tern- 
plating of these events from any other V H gene family can 
also be ruled out as members of the seven human Vh gene 
families differ significantly in the CDR sequences where 
the events described had occurred. 

Strtictural and Functional Considerations of Insertions and De- 
letions into V H Cenes. The events involving the insertion 
or deletion of a single amino acid from the CDR1 or 
CDR2 would not be expected to profoundly alter the back- 
bone structure of these molecules, as the CDRs are the 
most malleable portions of antibodies. The clone g80 has 
two of the five amino acids that are customarily considered 
its CDR] deleted, leaving only three amino acids to form 
this hypervariable loop (Fig. 1 B). Thus, this is one of the 
shortest CDRls reported to date. The clone tml21 has 
two amino acids Inserted into the FW3 region. The por- 
tion of the FW3 where this insertion occurred is believed 
to be solvent exposed and corresponds to the region where 
the B cell superantigen staphylococcal protein A binds to 
most V H 3-encoded Ig molecules (28); therefore, it is likely 
that the insertion Into this V H 6 clone can be tolerated as a 
loop or bulge on the molecule's surface. The most complex 
structural change observed In our study involved clone 
pg86, with a six amino acid insertion at the FW1/CDR1 
junction that would presumably double the length of this 
hypervariable loop and require dramatic structural accom- 
modation. However, we were able to express this heavy 
chain and found it paired with light chain, Indicating that it 
is likely functional (Fig. 5). The clone HBp2, containing a 
triplet insert into its CDR1, is particularly interesting be- 
cause it has a known specificity. This V H 6 gene was Isolated 
from a human B cell hybridoma with anti- Bordetella pertus- 
sis specificity (21 , 22). Clone HBp2 has also been expressed 
in the baculovirus system and is fully functional. We are 
currendy performing mutational analysis of this heavy chain 
molecule to determine if the additional inserted amino acid 
plays a role in the affinity and/or specificity of this anti- 
body. 

Analysis of Insertions and Deletions Reported in the Litera- 
ture. Various groups have reported a number of insertion 
and deletion events (Table 3). Virtually all of the insertions 
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PwJymttu* Slippage «* 
Stutenng Cmitd Accnuni 

DaughltrStrawd "Slippy" 
ofi'of Tomplm Sumd 



Loop tnMrmadUto 

Ucfc of DNA R*p*ir 
CoftlaMUlkm of 



Deletion Evtuts: 



t 

^2- 



AGT IAC IAC TGG »■ 0™!.- (T.»J«.) — AGC ACC TXC IAC AAC 

AGT IAC IAC IAC TGG — unKomu) — AGC ACC TAC AAC 

Figure 6. Proposed mechanism causing insertion/deletion events: poly- 
merase slippage. This Figure Is based on model a of Streisinger ei al. (30) 
and Ripley (31). The same model can account for both (A) insertions and 
(B) deletions. 



and deletions reported from somatically mutated V genes 
involved the untranslated regions or occurred in silent pas- 
senger transgenes. ] 9 out of 25 insertions or deletions Into 
somatically mutated genes involved predominantly repeti- 
tive elements, or in several cases other sequence patterns as- 
sociated with secondary structures such as internal homolo- 
gies or inverted repeats (Table 3). With the inclusion of the 
9 events described in this work. 28 out of 34 insertions and 
deletions involved such elements. Thus, the proximity of 
sequence elements that can be predicted to cause secondary 
structural changes in the DNA seems to be a hallmark of 
insertions and deletions into somatically mutated Vh genes. 

A Mode! for the Occurrence of Insertions and Deletions during 
Somatic Hypermutation. The evidence for the involvement 
of DNA secondary structure in the production of insertion 
or deletion mutations during somatic hypermutation. as sug- 
gested in 1986 by Colding et al. (29), now seems unequiv- 
ocal. The insertions and deletions described in our study, 
and those illustrated In Table 3, occur in a predictable fash- 
ion, involving sequence motifs that could form loop inter- 
mediates reminiscent of the replication slippage model of 
Streisinger et al. (30) and Ripley and Clickman (for review 
see 3!) as presented in Fig. 6. Such mutations are postu- 
lated to occur when DNA polymerase slips or stutters and 
the newly synthesized strand shifts on the template and re- 
anneals to an adjacent repetitive element, producing un- 
paired loop intermediates localized to one or the other 
si rands. If this unpaired loop intermediate is not repaired 
then it will be perpetuated as an insertion of an instance of 
the repetitive element if in the daughter strand, or a dele- 
tion if in the template strand. 

A Possible Correlation to Intrinsic Hotspots. A higher fre- 
quency of somatic hypemiutation has been reported to oc- 
cur at sequence motifs referred to as intrinsic hotspots (for 
review see reference 12). Interestingly, every insertion/de- 
letion event reported in our study resembled one of these 
hotspots (AGC, TAC. and AAC; references 1 2 and 27; Fig. 



4). The analysis of selected populations may have influ- 
enced this tendency because seven out of eight of these 
events occurred in the CDRs where it has been shown that 
hoLspol motifs are preferentially found (25, 26). Further- 
more, only a weak correlation to hotspots could be found 
for the previously reported insertions/deletions involving 
unsclected regions of V loci (Table 3). However, the single 
event found in this analysis that occurred outside of the 
CDRs in FW3 (clone tml21, Figs. 1 C and 4 A), also in- 
volved a tandem of possible hotspots (AAG. AAC). A more 
extensive and directed analysis Is required to fully address 
this issue. 

Implications for the Molecular Mechanism of Somatic Hyper- 
mutation. The instability of repetitive tracts during DNA 
replication is a hallmark of defects in postreplicative mis- 
match repair (33), and the locus-specific downregulatlon of 
DNA mismatch repair in response to UV irradiation has re- 
cently been reported Tor Immunoglobulin Vh genes In 
freshly sorted GC B cells (CD38TgD~) compared to man- 
tle zone B cells (CD38" IgD*; reference 34). In a recent 
study by Tran et al. (35). it was shown that tract instability 
of homonucleotide runs associated with mismatch repair 
defects occur more frequently in long than in short runs. 
These authors suggested that if loop Intermediates occur In 
long repetitive tracts (>8 bp for a homonucleotide run) 
they could involve a distal repetitive element out of reach 
of the polymerase proofreading activity and only be sub- 
jected to mismatch repair. However, for short repetitive 
tracts, as for the events reported in this analysis, loop inter- 
mediates can only occur proximal to the polymerase com- 
plex and are therefore subjected to both polymerase proof- 
reading and mismatch repair mechanisms. 

All 9 events in this analysis, and 19 out of 25 events from 
the literature (28 out of 34 Insertions and deletions re- 
ported), appeared to result from secondary structural inter- 
mediates. Loop Intermediates proximal to the polymerase 
complex during DNA polymerization should be repaired 
by the polymerase proofreading mechanisms immediately, 
or by the postreplicative DNA repair systems. This analysis 
suggest the following characteristics for the polymerization 
process during somatic hypermutation. (a) The polymerase 
interacts with the V locus in a particularly unstable or 
"loose" fashion, especially when hotspol motifs or elements 
capable of forming secondary structures are encountered, 
allowing bp substitutions in most instances, and Insertions 
or deletions via polymerase slippage at a much lower fre- 
quency: (6) it has limited proofreading capabilities; and (c) 
there is a downregulatlon of postreplicative mismatch re- 
pair. An efficient means to downregulate mismatch repair 
during somatic hypermutation could be through the lack of 
differentiation of the template and progeny strands for the 
mismatch repair system; lack of strand differentiation has 
been shown to Increase the rate of mutations introduced 
(36). Such a system would be advantageous for the locus- 
specific V gene somatic hypermutation in that it could In- 
volve alterations of a single enzymatic complex (polymerase 
complex) rather than multiple systems (proofreading and 
mismatch repair). Another system, which would have the 
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same advantage, i.e., the alteration of a single complex, 
would be the alteration of a DNA repair system such as 
transcription-coupled repair to be the somatic mutator, as 
suggested in recent studies (13). Alternatively, the inser- 
tions and deletions might result solely from a downregula- 
tion of poslreplicative mismatch repair at the V locus in the 
rapidly proliferating centroblasts that are undergoing so- 
matic hypermutatlon or due to a polymerase enzyme with 
such a high fault rate as to overwhelm any repair. 

All currently accepted models of somatic hypermutation. 
whether related to DNA excision-repair-like systems or 
transcription-repair, or to DNA polymerization or reverse 
transcription, involve transcriptional activation involving 
cis-factors in the V locus (enhancers, etc.) followed by the 
activity of unknown polymerase enzymes of some type. 
This analysis does not refute or corroborate any of these 
models directly, but it does provide further characterization 
of the polymerization system involved, based on the types 



of mutations observed and on the molecular biology that is 
known to cause such mutations. This analysis and the 
model presented here provide further Information or crite- 
ria to be contemplated as the various possible polymerase 
systems involved are considered. 

Conclusions. Insertions and deletions into immunoglob- 
ulin V H genes during somatic hypermutatlon are additional 
means by which the Immunoglobulin repertoire can be di- 
versified. These events display characteristics supporting mod- 
els of somatic hypermutation involving a particularly unsta- 
ble or error-prone polymerase to allow the introduction of 
mutations, and involving the downregulation of DNA repair 
to allow the perpetuation of these mutations. Additionally, 
we show that these events tend to Involve sequence motifs 
resembling intrinsic hotspots of somatic hypermutation, sug- 
gesting that the polymerase complex is destabilized in a se- 
quence-specific manner to allow preferential mutation at these 
sequence elements. 



Wc are grateful to Yucheng Li. Fang Zhao. Steve Scholl, Carol Williams, Shirley Hal), and Robin Wray, all 
of whom provided unprecedented technical assistance for various aspects of this work. We also thank Kim- 
ble Frazer for excellent discussions. 

These studies were supported by a grant from the National Institutes of Health (AI- 12127). J, Donald Capra 
holds the Edwin L. Cox Distinguished Chair in Immunology and Genetics. 

Address correspondence to Dr. Virginia Pascual, Molecular Immunology Center, Department of Microbiol- 
ogy. UT Southwestern Medical Center. 6000 Harry Hlnes Blvd.. Dallas. Texas 75235-9140. Phone: 214- 
648-1918: Fax: 214-648-1915: E-mall: vpascu@mednei.swmed.edu 

Received for publication 14 July 1997 and in revised form 1 October 1997. 



References 

1. Clark, E.A., and J. A. Ledbetter. 1994. How B and T cells talk 
to each other. Nature. 367:425-428. 

2. Dorken, B., P. Moller, A. Pezzulto. R. Schwartz-Albiez. and 
C. Moidenhauer. 1989. B-cell antigens. In Leukocyte Typing 
IV. A.J. McMichael, editor. Oxford University Press. Lon- 
don, pp. 131-140. 

3. Ling. N.R., l.C.M. MacLennan. and D.Y. Mason. 1987. B-cell 
and plasma cell antigens: new and previously defined clusters. 
In Leukocyte Typing HI, A.J. McMichael, editor, Oxford Uni- 
versity Press, London, pp. 302-308. 

4. Pascual. V., Y.J. Liu. A. Magalskl. O de Boutciller. J. 
Banchereau, and J.D. Capra. 1994. Analysis of somatic muta- 
tion In five B cell subsets of human tonsil. / Exp. Med. 180: 
329-339. 

5. Liu, Y.J., O. de Bouieiller, C. Arpin, F. Briere. L. Calibert, 
S. Ho. H. Martinez-Valdez, J. Banchereau. and S. Lebecque. 
1996. Normal human IgDMgM" germinal center B cells can 
express up to 80 mutations in the variable region of their IgD 
transcripts. Immunity. 4:603-613. 

6 Denepoux, S , D. Razanajaona, D. Blanchard, C. MefTre. 
J.D. Capra. J. Banchereau, and S. Lebecque. 1997. Induction 
of somatic mutation in a human B cell line in vitro. Immunity. 
6:35-46. 

7. Berek. C. 1993. Somatic mutation and memory. Cure. Opin. 
Immunol. 5:218-222. 

8. Kelsoe. G. 1996. Life and death in germinal centers (redux). 
Immunity. 4:107-111. 



9. Liu, Y.J., CD. Johnson. J. Gordon, and l.C.M. MacLennan. 
1992. Ccrminal centres In T-cell-dependent antibody re- 
sponses. Immunol. Today. 13:17-21. 

10. MacLennan. l.C.M.. Y.J. Liu, and CD. Johnson. 1992. Mat- 
uration and dispersal of B-cell clones during T cell-depen- 
dent antibody responses. Immunol. Rev. 126:143-161. 

11. McKean, D., K. Huppi. M. Bell. L. Staudt. W. Gerhard, and 
M. Weigert. 1984. Generation of antibody diversity In the 
immune response of BALB/c mice to influenza virus hemag- 
glutinin. Proc. Natl. Acad. Sci. USA. 81:3180-3186. 

12. Neuberger. M.S.. and C. Milslcln. 1995. Somatic hypermu- 
tation. Curt. Opin. Immunol. 7:248-254. 

13. Storb. U. 1996. The molecular basis of somatic hypermuta- 
tion of immunoglobulin genes. Curr. Opin. Immunol. 8:206-214. 

14. Liu. Y.J., O. de BoutelfJer. C. Arpin. I. Durand, and J. 
Banchereau. 1994. Five human mature B cell subsets. Adv. 
Exp. Med. Biol. 355:289-296. 

15. Marks. J.D.. M. Trlslem. A. Karpas. and G. Winter. 1991. 
Oligonucleotide primers for polymerase chain reaction am- 
plification of human immunoglobulin variable genes and de- 
sign of family-specific oligonucleotide probes. Eur. J. Immu- 
nol. 21:985-991. 

16. Sanz, I., P. Kelly, C. Williams. S. Scholl. P. Tucker, and J.D. 
Capra. 1989. The smaller human V H gene families display re- 
markably little polymorphism. EMBO (Eur. Mol. BM Organ.) J. 
8:3741-3448. 

17. Potter. K.N., Y.C. Li. and J.D. Capra. 1994. The cross-reac- 



69 Wilson et al. 



m m 



live idiotopes recognized by the monoclonal aniibodies 9G4 
and LC I are located in framework region ] of iwo non-ovcr- 
lapplng subsets of human V H 4 family encoded aniibodies. 
Srand. J. Immunol. 40:43-49. 

18. Tomlinsori, I.M.. G. Waller. J.D. Marks. fvl.B. Llewelyn, and 
C. Winter. 3992. The repertoire of human germline V H se- 
quences reveals about fifty groups of V H segments with differ- 
ent hypervariable loops. / Mol. Biol. 227:776-798. 

19 Uniar, A., and P.J. Gearhart. 1995. Reciprocal homologous 
recombination In or near antibody VDJ genes. Eur. j. Immu- 
nol. 25:2392-2400. 

20. Ford, J.E.. M.G. McHeyzer-Williams. and M R. Lleber. 1994 
Chimeric molecules created by gene amplification Interfere 
with the analysis of somatic hypermuiaiion of murine immu- 
noglobulin genes. Gene. 142 279-283. 

21. Andris. J.S.. B R. Brodeur. and j.D. Capra. 1993. Molecular 
characterization of human antibodies to bacterial antigens: 
utilization of the less frequently expressed V H 2 and V H 6 
heavy chain variable region gene families. Mol. Immunol. 30: 
1601-1616 

22. Brodeur. B.R.. J. Hamel. D. Martin, and P. Rondeau. 199] 
Biological activity of a human monoclonal antibody to Boidn- 
tella pertussis lipooligosaccharide. Hum. Antib. Hybrid. 2:194-199. 

23. Betz, A.C.. C. Rada. R. Pannell. C. Milstein. and M.S. Ncu- 
berger. 1993. Passenger transgenes reveal intrinsic specificity 
of the antibody hypermutation mechanism: clustering, polar- 
ity, and specific hot spots. Proc. Nail. Acad. Sci. USA. 90: 
2385-2388. 

24. Rogoziti. I.B., and N.A Kolchanov. 1992. Somatic hyper- 
mutagenesls in immunoglobulin genes. II. Influence of neigh- 
bouring base sequences on mutagenesis. Blochim. Bioptiys. 
Arts. 1171:11-18. 

25. Goyenechea, B., and C. Milstein. 1996. Modifying the se- 
quence of an immunoglobulin V-geno alters the resulting pat- 
tern of hypermutation. Proc. Nail. Acad. Sd. USA. 93:13979- 
13934. 

26. Wagner, S.D.. C. Milstein, and M.S. Neuberger. 1995. 
Codon bias targets mutation. Nature. 376:732-733. 

27. Smith. D.S., G. Creadon. P.K. Jena. J.P. Portanova. B L. 
Kotzin, and L.J. Wysocki. 1996. Dl- and trinucleotide target 
preferences of somatic mutagenesis in normal and autoreac- 
tive B cells. J. Immunol. 156:2642-2652. 

28. Potter. K.N., Y. Ll, and J.D. Capra. 1996. Staphylococcal 
protein A simultaneously interacts with framework region 1. 
complementarity-determining region 2, and framework re- 
gion 3 on human V H 3-encoded Igs. / Immunol. 157:2982- 
2988. 

29. Colding. G.B., P.J. Gearhart. and B.W. Glickman. 1987. Pat- 
terns of somatic mutations in immunoglobulin variable genes. 
Genetics. 115:169-176. 

30. Strt'lsinger, C., Y. Okada, J. Emrich. J Newton, A. Tsugita, 
E. Terzaghl. and M. Inouye. 1966. Frameshlft mutations and 
the genetic code. This paper is dedicated to Professor Theo- 
dosius Dobzhansky on the occasion of his 66ih birthday. Cold 
Spring Harbor. Symp. Quant. Biol. 31:77-84. 

31. Ripley. LS. 1990. Frameshift mutation: determinants of 
specificity. Annu. Rev. Genet. 24:189-213. 

32. Modrich. P., and R. Lahue. 1996. Mismatch repair in repli- 
cation fidelity, genetic recombination, and cancer biology. 
Annu. Rev. Blochem. 65:101-133. 

33. Strand. M.. T.A. Prolla, R.M. Liskay, and T.D. Petes. 1993. 
Destabiltzation of tracts of simple repetitive DNA In yeast by 
mutations affecting DNA mismatch repair. Nature. 365: 



274-276. 

34. Fairhurst. R.M., Y. Valles-Ayoub, M. Neshat. and J. Braun. 
1996. A DNA repair abnormality specific for rearranged im- 
munoglobulin variable genes In germinal center B cells. Mol. 
Immunol. 33:231-244. 

35. Tran, H.T.. J.D. Keen, M. Kricker, M.A. Resnick, and D A. 
Gordenln. 1997. Hypermutability of hotiionucleollde runs In 
mismatch repair and DNA polymerase proofreading yeast 
mutants. Mol. Cell Biol. 17:2859-2865. 

36. MacPhee, D.C. 1996. Mismatch repair as a source of muta- 
tions in non-dividing cells. Genetlca. fflie Hague). 97:183-195. 

37. Matsuda, F.. and T. Honjo. 1996. Organization of the human 
immunoglobulin heavy-chain locus. Adv. Immunol. 62:1-29. 

38. Dunn-Walters. D.K., P.C. Isaacson, and J. Spencer. 1997. 
Sequence analysis of human lgV H genes Indicates that Ileal 
lamina propria plasma cells are derived from Peyer's patches. 
Eur. J. Immunol. 27:463-467. 

39. Both. C.W.. L. Taylor. J.W. Pollard, and E.J. Steele. 1990. 
Distribution of mutations around rearranged heavy-chain an- 
tibody variable-region genes. Mol. Cell Biol. 10:5187-5196. 

40. Allen. D., T. Simon. F. Sablitzky. K. Rajewsky. and A. Cu- 
mano. 1988. Antibody engineering for the analysis of affinity 
maturation of an anti-hapten. EMBO (Eur. Mol. Biol. Organ.) 
J. 7:1995-2001. 

41. Kim, S„ M. Davis. E. Sinn, P. Patten, and L. Hood. 1981. 
Antibody diversity: somatic hypermuiaiion of rearranged V H 
genes. Cell. 27:573-581. 

42. Lebecque. S C., and P.J. Gearhart. 1990. Boundaries of so- 
matic mutation In rearranged Immunoglobulin genes: 5' 
boundary is near the promoter, and 3' boundary is approxi- 
mately 1 kb rrom V(D)J gene. /. Exp. Med. 172:1717-1727. 

43. Kataoka. T., T. Nikaldo, T. Mlyata, K. Moriwakl, and T. 
Honjo. 1982. The nucleotide sequences of rearranged and 
germline Immunoglobulin V H genes of a mouse myeloma 
MC101 and evolution of V M genes in mouse. / Biol. Chem. 
257:277-285. 

44. Gearhart, P.J., and D.F. Bogenhagen. 1983. Clusters or point 
mutations are found exclusively around rearranged antibody 
variable genes. Proc. Natl. Acad. Scl. USA. 80:3439-3443. 

45. Rickert. R. and S. Clarke. 1993. Low frequencies of somatic 
mutation In two expressed V kappa genes: unequal distribu- 
tion of mutation In 5' and 3' flanking regions. Int. Immunol. 
5:255-263. 

46. Weber. J.S.. J. Berry, S. Lltwln, and J.L. Claflln. 1991. So- 
matic hypermutation of the JC intron is markedly reduced In 
unrearranged kappa and H alleles and is unevenly distributed 
In rearranged alleles. / Immunol. 146:3218-3226. 

47. Rogerson, B., J. Hackett. Jr., A. Peters, D. Haasch, and U. 
Storb. 1991. Mutation pattern of Immunoglobulin transgenes 
Is compatible with a model of somatic hypermutation in 
which targeting of the mutator is linked to the direction of 
DNA replication. EMBO (Eur. Mol. Biol. Organ.) J. 10:4331- 
4341. 

48. Kosmas, C, N.A. Viniou. K. Stamatopoulos, N.S. Courte- 
nay-Luck, T. Papadaki. P. Kollia, G. Paterakis. D. Anagnos- 
tou, X. Yataganas. and D. Loukopoulos. 1996. Analysis of 
the kappa light chain variable region in multiple myeloma. 
Br. J. Haematol. 94:306-317. 

49. Wu, H.Y., and M. Kaartinen. 1995. The somatic hypermuta- 
tion activity of a follicular lymphoma links to large insertions 
and deletions of immunoglobulin genes. Stand. J. Immunol. 
42:52-59. 



70 Somatic Hypermutation: Insertions and Deletions 



Vol. 277, No. 47, Issue of November 22, pp. 45108-45114, 2002 
Prinud in U.S.A. 

Functional Consequences of Insertions and Deletions in the 
Complementarity-determining Regions of Human Antibodies* 

Received for publication, August 16, 2002, and in revised form, September 16, 2002 
Published, JBC Papers in Press, September 16, 2002, DOI 10.1074Vjbc.M208401200 

Johan Lantto and Mats Ohlini 

From the Department of Immunatechnology. Lund University, P.O. Box 7031, S-220 07 Lund, Sweden 



The Journal of Biological Chemistry 

O 2002 by The American Society for Biochemistry and Molecular Biology. Inc. 



Insertions and deletions of nucleotides in the genes 
encoding the variable domains of antibodies are natural 
components of the hypermutation process, which may 
expand the available repertoire of hypervariable loop 
lengths and conformations. Although insertion of amino 
acids has also been utilized in antibody engineering, 
little is known about the functional consequences of 
such modifications. To investigate this further, we have 
introduced single-codon insertions and deletions as well 
us more complex modifications in the complementarity- 
determining regions of human antibody fragments with 
different specificities. Our results demonstrate that 
single amino acid insertions and deletions are generally 
well tolerated and permit production of stably folded 
proteins, often with retained antigen recognition, de- 
spite the fact that the thus modified loops carry amino 
acids that are disallowed at key residue positions in 
canonical loops of the corresponding length or are of a 
length not associated with a known canonical structure. 
We have thus shown that single-codon insertions and 
deletions can efficiently be utilized to expand structure 
and sequence space of the antigen-binding site beyond 
what is encoded by the germline gene repertoire. 



Antibodies are highly specific receptors of the immune sys- 
tem that also have a great potential as reagents in biological 
chemistry and as therapeutic agents. The part of the antibody 
that makes contact with the antigen is comprised of two vari- 
able (V) 1 domains, the heavy (H) and the light (L), which both 
are made up of a two-/3-sheet framework. From this framework, 
six complementarity-determining region (CDR) loops, three 
from the light domain and three from the heavy domain, pro- 
trude and make up the antigen-binding site CI, 2). Five of these 
CDR loops generally adopt only a limited number of backbone 
conformations, so-called canonical structures (reviewed in Ref. 
3), which are determined by the lengths of the loops and by the 
presence of specific key residues. The antigen specificity of the 
binding site is mainly determined by the sequence and confor- 
mation of these CDR loops. 

Antibody diversity is generated by the imprecise recombina- 
tion of two or three sets of germline gene segments and by the 
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combination of different heavy and light domains (4). The di- 
versity is further increased by the process of somatic hypermu- 
tation (5) and by receptor editing and revision (6). As the 
germline variable gene repertoire encodes a rather limited 
number of CDR loop lengths (IMGT, the international ImMu- 
noGeneTics data base, Ref. 7), the number of observed canon- 
ical structures is similarly limited. However, it was recently 
discovered that B celta evolve the genes encoding immunoglob- 
ulin V domains not only by nucleotide substitution but also 
through an additional mechanism of insertion and deletion of 
nucleotides during the hypermutation process (8-11). This 
mechanism has the potential to expand the available repertoire 
of loop lengths and conformations if the insertions and dele- 
tions involve entire codons and occur at positions in the 
sequence that can tolerate such modifications. A number of 
examples of seemingly functional insertions and deletions in 
the CDR of both the heavy and light domains of human anti- 
bodies have in fact been encountered lately (Refs. 8 and 12 and 
references therein). Furthermore, we have recently discovered 
that human 1GHV 2 germline genes carry features in CDR1 and 
CDR2 that make these regions particularly prone to deletions 
of entire codons (12). 

The occurrence of insertions and deletions in antibody V 
genes is not only of fundamental interest but is also of biotech- 
nological importance. It has been known for some time that the 
topography of the antigen-binding site is related to the size of 
the antigen (13-15). Three different types of binding sites have 
been described; cavity, groove, and planar, which roughly cor- 
respond to hapten, peptide, and protein, respectively. This re- 
lationship has been further investigated by Vargas-Madrazo et 
al. (16), who have described a correlation between the length of 
the CDR loops and the antigen recognized. According to these 
findings, cleft-like binding sites that recognize small molecules 
are created by long loops (especially the CDRH2 and LI loops), 
whereas planar-binding sites that are specific for large mole- 
cules are formed by short loops. In other words, by modifying 
the loop lengths of an antibody-binding site, it may thus be 
possible to design antibodies optimally suited for recognition of 
a particular class of antigen. Lamminmfiki et al. (17) have in 
fact used this approach to modify a murine antibody specific for 
17B-estradiol. They introduced additional residues into CDR2 
of the heavy domain and were able to improve the recognition 
of the antigen. This improvement was suggested to be the 
result of a deeper binding site, created through the extension of 
CDRH2, which better accommodated the hapten (17). 

Despite the establishment of insertions and deletions as 
naturally occurring modifications of antibody sequences and 
the use of amino acid insertions for antibody engineering, little 
is still known about the functional consequences of such mod- 
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ideations. We have therefore created single-codon insertions 
and deletions as well as more complex modifications in the 
CDR of two human antibody single chain V region fragments 
(scFv) specific for a peptide and a hapten, respectively, and 
investigated the effects on antigen recognition, thermal stabil- 
ity, and protein folding. Our results demonstrate that single 
amino acid insertions in both CDRH1 and H2 and deletions in 
CDRH2 are usually well tolerated and permit production of 
folded proteins despite the fact that the modified loops carry 
amino acids that are disallowed at key residue positions in 
canonical loops of the corresponding length or do not take on a 
characteristic length of a known canonical structure. Modifica- 
tions of this kind are in other words an efficient mode of 
expanding antibody sequence and structure space beyond what 
is encoded by the germline gene repertoire, which may enable 
targeting of novel or otherwise poorly immunogenic antigens. 

EXPERIMENTAL PROCEDURES 

Antibody Frameworks — The frameworks encoding the anti-cytomeg- 
alovirus scFv AE11F and the anti-fluorescein isothiocyanate (FiTC) 
scFv FITC8 have been described elsewhere (18-20). The cloning and 
production of the AEUF and AE11F/3-20L1 scFv in Pic-Ala pasloris 
have also been described 121). 

Creation of Insertion and Deletion Variants — Mini-libraries uf scFv 
genes carrying codon insertions at various positions were created by the 
use of overlap extension PCRwith degenerate primers that introduced 
NNK codons. Variants with a deletion were similarly created with 
primers lacking one codon. The AEllF-based variants carrying CDR11 1 
sequences derived from the IGHV4 subgroup were created using the 
CDR-shuffling technique (22) essentially as described previously 
(21, 23). 

Production and Purification of scFu Variants— The F1TC8 scFv and 
all variant scFv were cloned into the pPICZa vector (Invitrogen) with 
C-terminal FLAG sequences (24) and produced in P. pastoris as de- 
scribed previously (21). The mini-libraries encoding AE11F and FITC8 
variants were screened for scFv production or antigen binding accord- 
ing to the colony lift assay by McGrcw et al. (25). Briefly, transformed 
P. pastoris colonies were lifted onto cellulose acetate filters (Pall 
Gelman Sciences, Ann Arbor, MI) and were grown on top of nitrocellu- 
lose filters, which were placed on methanol-containing plates. After 
48 h of induction, scFv bound to the nitrocellulose filters were detected 
by a combination of anti-FLAG M2 antibody (Sigma) and rabbit anti- 
mouse Ig/horseradish peroxidase conjugate (DAKO A/S, Glostrup. 
Denmark) or FITC-biotin (Sigma) and streptavidin/horseradish perox- 
idase conjugate (DAKO A/S) using the ECL Plus™ Western blotting 
detection reagents (Amersham Biosciences) according to the manufac- 
turer's recommendations. Single colonies were also picked and grown in 
liquid cultures to enable further characterization orthe antigen binding 
properties (see below). In addition, a number of scFv variants were 
produced at a larger scale and purified as monomers. The AEllF-bascd 
variants were purified essentially as described previously (21 ), whereas 
the FlTC8-based variants were purified by affinity chromatography on 
a Sepharose resin with FITC-conjugated bovine serum albumin (BSA) 
(kindly provided by Dr. B. Jansson, Biolnvent Therapeutic AB, Lund, 
Sweden) followed by gel filtration as before. 

Analysis of Antigen Recognition—The reactivity of the scFv variants 
with different antigens, both as crude expression supernatants and as 
purified monomers, was analyzed by enzyme-linked immunosorbent 
assay (ELISA) and by using the BIAcore technology (BIAcorc AB, 
Uppsala, Sweden). The AEJIF-based clones were tested on BSA, 
ovalbumin, streptavidin, and a biotinylated peptide that mimics the 
viral epitope (21) bound via streptavidin and the FITC8-baeed clones on 
BSA, streptavidin, FITC-BSA, FITC-biotin (bound via streptavidin), 
and a number of irrelevant BSA-coupled haptens obtained from Sigma 
or Biosearch Technologies Inc. (Novate, CA>. The ELISA was performed 
according to standard protocols with anti-FLAG M2 (Sigma) and rabbit 
anti-mouse immunoglobulin/horseradish peroxidase conjugate (DAKO) 
to detect bound scFv. The BIAcore measurements and the calculation of 
the reaction rate kinetics were performed essentially as described 
previously (21). 

Differential Scanning Catorimetry <DSC) — DSC measurements were 
performed using a VP-DSC from Microcal Inc. (Northampton, MA) in 
the temperature range 20-90 °C at a heating rate of 607h. All meas- 
urements were performed in phosphate-buffered saline (PBS), pH 7.4, 
containing 0.02% sodium azide at protein concentrations between 0.1 



and 0.2 mg/ml with PBS in the reference cell. Prior to protein versus 
PBS measurements, PBS versus PBS scans were performed. 

CD Spectroscopy — CD spectra were recorded on a J-720 spectropola- 
rimeter (Jasco Inc., Easton. MD) in a 2-mm cuvette at a protein con- 
centration of 0.1 mg/ml in 50 mM sodium phosphate, pH 7.4. Each 
sample was scanned two to eight times from 250 to 200 nm at a scan 
speed of 10 nm/min, a resolution of 1 nm, a bandwidth of 1 nm, and a 
sensitivity of 20 millidegrees, and the scans were combined to produce 
the final spectrum. Data are presented as mean residue molar elliptic- 
ity, which was calculated using the mean residue weight of each scFv. 

Sequencing and Canonical Structure Classification — The nucleotide 
sequences of the variant scFv clones were determined by automated 
DNA sequencing as described elsewhere (26) after isolation of the 
templates by direct PCR on P. pastoris colonies using vector-specific 
primers. In the case of the CDRIIl-grafted clones, the origin of the CDR 
was determined using the IMGT/V-QUEST alignment tool at IMGT, the 
international ImMunoGeneTics data base (imgt.cines.fr and Ref. 7). All 
sequences were defined and numbered in accordance with the IMGT 
nomenclature and unique numbering (7). Complete sequences of the 
variant scFv from this study can be found in GenBank™ under acces- 
sion codes AF543317-AF543349. The canonical structure classification 
was performed using the software implemented on the Antibodies - 
Structure and Sequence server (www.bioinf.org.uk/nbs/chothia.html 
and Ref. 27). 

RESULTS 

The scFv Frameworks — The parent antibody frameworks 
used in this study are both of human origin although there are 
differences in the way they were obtained. The AEUF scFv 
was derived from a monoclonal antibody isolated from a cyto- 
megalovirus-seropositive blood donor (18, 19). It originates 
from the IGHV3-30 and IGKV3-11 genes, which both have 
acquired a number of mutations (21). This scFv recognizes both 
intact glycoprotein B from cytomegalovirus and peptides mim- 
icking the AD-2 epitope (21, 28). The hapten (FITC)-specific 
scFv FITC8 was derived from a synthetic scFv library, which 
had been constructed by shuffling of human CDR sequences 
into a single framework consisting of the human IGHV3-23 
and 1GLV1-47 genes (20). The CDR sequences utilized by this 
scFv originate from IGHV3-7 and IGHV3-23 in the case of 
CDRH1 and CDRH2, IGLV1-40 and IGLV1-40 or 1GLV1-50 
in the case of CDRL1 and CDRL2, and IGLV1-47 in the case of 
CDRL3. Except for the CDRL1 loop, which is one residue longer 
than the IGLV1-47 germline length, the CDR loops of the 
F1TC8 scFv are of the same length as the loops normally 
encoded by the framework genes. As the structures of the 
two scFv have not been determined, the loop structures are 
unknown. However, by analyzing the deduced amino acid 
sequences using the tools at the Antibodies - Structure and 
Sequence server (27), the most similar of the observed canon- 
ical classes were identified (Table 1). 

Single-codon Insertions and Deletions— To determine the 
capability of the two antibody frameworks to tolerate length 
modifications in the CDR loops, we made single-codon inser- 
tions in CDRH1 and CDRH2 and a single-codon deletion in 
CDRH2. The modifications involved insertions after positions 
31-33 in CDRH1, insertions after positions 57 and 58 in 
CDRH2, and a deletion at position 58 in CDRH2 (Fig. 1). All 
modifications were introduced at positions corresponding to the 
apices of the loops, i.e. the positions where the natural length 
variation occurs (31). A study of the IGHV germline gene rep- 
ertoire has shown that these parts of the CDR carry repetitive 
sequence tracts, which naturally target them with deletions 
(and possibly also insertions) during the hypermutational proc- 
ess (12). Residues in these regions have al30 been shown to 
frequently make contact with the antigen in known antibody- 
antigen complexes (15), suggesting that modifications at the 
above mentioned positions will result in an expansion of struc- 
ture space that i3 relevant for antigen recognition. 

Libraries of scFv clones producing different insertion vari- 



45110 



Insertions and Deletions in Antibody Hypervariable Loops 



Table 1 

Examples of various single-codon modifications in scFv dories based on the AEIIF and FITCS frameworks, the canonical class 
belonging of the CDH loops, reactivity of the scFv with the original antigens as determined by E LISA or by BIAcore measurements, 
and the unfolding temperature of selected clones as determined by DSC 
Modification refers to the nature of the changes in loop length; Ins indicates insertion, and Del indicates deletion. Numbering is according to the 
IMGT unique numbering (7). Canonical class indicates the combination of canonical structures of CDRHl, H2, and LI as determined by automatic 
canonical structure classification (27). The altered canonical structure is indicated in bold. Antigen recognition: -, negative; ±, weakly positive; 
+, positive; + +, strongly positive. 



scrv clone 


Modification 


LanonicaJ class 


Antigen 
recognition 


Unfolding 
temperature 










°C 


AE11F 


Original sequence 


1-3-2 


+ + 


62.4 


ASV18 


Ins Pro-31A 


2-3-2 


+ 




ASV19 


Ins Asn-31A 


2-3-2 


+ 




ASV43 


Ins Arg-31A 


2-3-2 


+ 




ASV15 


Ina Hia-32A 


2-3-2 


+ 




ASV37 


Ins Ile-32A 


2-3-2 






ASV39 


Ina Phe-32A 


2-3-2 


+ 




ASV02 


Ina Phe-33A 


2-3-2 


* 




ASV35 


Ina Aan-33A 


2-3-2 






ASV07 


Ina Lys-57A 


1-U--2 


+ + 


63.0 


ASV08 


Ina Iie-57A 


l-U-2 


+ 




ASV28 


Ina Thr-57A 


l-U-2 


+ + 




ASV05 


Ins Glu-58A 


l-U-2 


+ + 




ASV10 


Del Val-58 


1-1-2 




61.9 


AEUF/3-20L1 


Ina CDRL1* 


1-3-6" 
HI-H2 




62.3 


FITC8 


Original sequence 


1-3 


+ + 


63.4 


FSV71 


Ina Scr-31A 


2-3 


+ 




FSV73 


Ina His-31A 


2-3 


* 




FSV76 


Ins Arg-31A 


2-3 






FSV81 


Ins Asn-32A 


2-3 


+ 




FSV84 


Ins Pro-32A 


2-3 


+ + 


61.7 


FSV85 


Ina Arg-32A 


2-3 


+ 




FSV91 


Ina Leu-33A 


2-3 


+ 




FSV93 


Ins His-33A 


2-3 


+ 




FSV96 


Ins Tyr-33A 


2-3 


+ 




FSV51 


Ina Ser-57A 


1-U 


+ + 




FSV52 


Ina Ala-57A 


1-U 


+ + 




FSV56 


Ins Leu-57A 


1-U 


+ + 




FSV43 


Ina Thr-58A 


1-U 


+ + 


61.1 


FSV46 


Ins Arg-58A 


1-U 






FSV61 


Del Gly-58 


1-1 


+ 


63.9 



" U indicates that the canonical structure of the created loop length ia currently unknown. 
" See text for detaila regarding the insertion. 

" The automatic canonical class algorithms failed to unambiguously predict a structure for the CDRL1 loop of this scFv. Similarities in length 
and sequence with Fab 117 (PDB entry lfig) suggest that the loop belongs to canonical structure class 6 (30). 



ants were screened directly by the use of a colony lift assay (25). 
This analysis showed that —95% of the clones based on the 
FITCS framework had retained their specificity for FITC (data 
not shown). The libraries bused on the AEllF framework were 
screened for the production of FLAG-carrying proteins, and a 
similar ratio of clones positive for scFv production was obtained 
(data not shown). Both positive and negative clones from each 
library were sequenced to determine the nature of the modifi- 
cations, and the analysis showed that a wide range of amino 
acids was inserted at the intended positions. To determine the 
effect of these length modifications on the structure of the 
targeted loops, the most similar canonical structures were 
identified by the automatic canonical structure classification 
(27). A number of examples from each insertion library and the 
deletion variants are presented in Table I. 

As the AEllF-based libraries were only tested for the pro- 
duction of FLAG-tagged proteins, they had to be characterized 
further to determine whether the scFv were functionally 
folded. This was done by analyzing the antigen-binding prop- 
erties of the modified clones. Although changes in loop struc- 
ture may be associated with a loss of antigen recognition, 
specific recognition of an antigen will confirm that the polypep- 
tide chain is correctly folded as this is a requirement for it to 
function as a framework for the antigen-binding site. Analysis 
of expression aupernatants of randomly picked clones (includ- 
ing the deletion variants) by ELISA or by using the BIAcore 



technology confirmed the above finding that the majority of the 
FITC8-based clones recognized the original antigen. Impor- 
tantly, this analysis showed that most of the AEllF-based 
clones had also retained their specificity for the original viral 
antigen (Table 1). Furthermore, when tested for binding to a 
number of irrelevant antigens (see "Experimental Proce- 
dures"), none of the clones displayed any cross-reactivity (data 
not shown), demonstrating that the modified scFv clones re- 
tained a high degree of specificity for the original antigens and 
therefore likely also assumed a correct immunoglobulin fold. 

A number of clones of each specificity, chosen to exemplify 
the different modifications, were produced at a large scale to 
study the interaction with the original antigens in detail and 
determine the stability of the purified proteins. BIAcore meas- 
urements with the purified monomers of the ASV07, ASV10, 
ASV35, FSV43, FSV61, and FSV84 clones confirmed the pre- 
viously obtained results with crude expression supematants 
(Table I and Fig. 2). Furthermore, evaluation of the reaction 
rate kinetics with the original antigen showed that the modi- 
fications did not affect the dissociation rates of the FITC8- 
based clones to any greater extent (Fig. 2fi). The thermal 
stability of the purified monomers was determined by DSC, and 
all tested clones displayed unfolding temperatures very similar 
to the parent scFv (Table I), further verifying that the IGHV3- 
derived antibody frameworks tolerate single-codon insertions 
and deletions in CDRHl and H2 very well. 
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using the WAM algorithm (29), a determined structure of the protein-specific antibody B7-15A2 (Protein Data Bank entry laqk), which originates 
from a highly related 1GHV gene and has a CDRH3 of the same length as AE11F, and a structure model of FITC8 (20). CDRH3 is Bhown in red, 
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Fic. 2. The single-codon modifications did not affect the reaction rate kinetics of the FITC8-based scFv variants to any greater 
extent. Representative BIAcore aensorgrame of AEllF-boscd scFv and FiTC8-based scFv analyzed on streptavidine bound viral peptide (A) or 
streptavidine-bound FITC (j9), respectively. Dissociation rate constants (kj) were calculated from multiple measurements and are presented as the 
mean value ± S.E. N.A., not applicable; RU, resonance units. 



As insertions and deletions have been demonstrated to occur 
naturally in both heavy and light domain V genes (8), we 
decided to extend this study and also evaluate the stability of a 
previously produced AE1 IP-based scFv variant with an inser- 
tion in CDRL1 (AE11F/3-20L1) (21). The modified CDRL1 of 
this scFv is identical, except for an additional serine residue, to 
the germline gene" from which AE1 IF originates. This clone has 
also been demonstrated to recognize both the epitope- 
mimicking peptide and intact, recombinant glycoprotein B, albeit 
with a lower affinity than the affinity matured AE11F scFv (21, 
32). The thermal stability of the AE11F/3-20L1 scFv was deter- 
mined as before after purification of monomelic scFv, and the 
unfolding temperature was found to be similar to that of the 



original scFv (Table I), thus indicating that not only heavy but 
also light domain CDR tolerate modifications of this nature well. 

Grafting of CDRH1 Loops from Distantly Related IGHV 
Genes — As all of the insertions and deletions described so far 
were introduced at the tips of the hypervariable loops, the parts 
of the immunoglobulin fold that best can be expected to accom- 
modate such modifications, we decided to introduce more ex- 
tensive modifications to investigate the effect of such changes 
of antibody sequence and structure. These modifications were 
introduced into and immediately adjacent to CDRH1 of the 
AE11F framework by the CDR-shuffling technique (22) using 
CDR sequences isolated from activated human B cells. Se- 
quences originating from the IGHV4 subgroup were chosen for 
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Table II 

Deduced amino acid sequences, germline gene origin, and canonical structure class belonging of the CDRHI loops of tlie AEUF scFu and the 

CDRH1 -grafted variants of this 

Amino acid sequences are aligned and numbered in accordance with the IMGT unique numbering (7) and gaps thereby introduced are indicated 
by dashes. Amino acids that are part of the CDR1-IMGT (7) are underlined. Dots indicate identity with the AE11F sequence. Canonical structures 
were determined by automatic canonical structure classification (27). 



scFv clone 


Amino acid sequence 


Germline origin 


Canonical 
structure 




2 3 33334 4 








2 0 45680 4 






AE11F 


SCAASGFIFSEYD — MHWVRO 


IGHV3-30 


1 


E3 


. . .V. .GS1.SGGYYWS.I. . 


IGHV4-3J 


3 


E6 


. . .V. .YSI.SGYY-WG.I. . 


lGHV4-b 


2 


E10 


. . .V. .GSI.S.Y-- WS.I. . 


IGHV4-S9 


1 


EH 


. . . V. .GSI .G.H— WS.I. . 


1GHV4-34 


1 


EH 


. . .V. .GSI.SGGYSWS.I. . 


IGHV4-30-2 


3 



the grafting as these are only distantly related to the [GHV3 
CDR and therefore allow for a higher degree of variability. In 
addition, genes from the IGHV4 subgroup encode loops of dif- 
ferent lengths than genes from the IGHV3 subgroup, including 
loops of the same length as the ones created by the single-codon 
insertions in CDRHI, thus enabling a comparison with these 
modifications. Sequencing of randomly picked clones showed 
that seemingly functional, i.e. in-frame and without stop 
codons, IGHV3 genes carrying IGHV4-derived CDRHI se- 
quences were obtained (Table II). However, when analyzing 
crude expression supernatants of the constructs, it was found 
that all of the clones had lost the original antigen specificity 
and instead acquired a polyreactive character (Fig. 3). 

To further investigate this polyreactive nature of the 
CDRHl-grafted clones, two of them, E3 and E6, were produced 
at a larger scale and purified as monomers to enable structural 
characterization. These two clones were chosen based on the 
presence of loop lengths different from the one used by the 
parent antibody (Table ID. As judged by analytical gel filtra- 
tion, these clones also gave rise to proteins that behaved as 
scFv monomers (data not shown). The overall secondary struc- 
ture was determined by CD spectroscopy and was compared 
with the results obtained with other monomeric scFv. As shown 
in Fig. 4, the spectra of both of the CDRHl-grafted clones 
displayed a strong negative signal near 200 nm, which is in- 
dicative of unordered polypeptides (33). For a comparison, the 
spectra of both the parent scFv and the FITC8 scFv displayed 
a weak negative signal near 217 nm, which is characteristic of 
the |3-sheet conformation of antibody domains (Fig. 4). The 
same result was also obtained with clones carrying single- 
codon modifications, such as the AEUF/3-20L1 and the FSV43, 
which gave rise to nearly identical spectra as the parent scFv 
(data not shown). When analyzed by DSC, no unfolding tem- 
peratures could be determined for either of the E3 or E6 scFv, 
suggesting that the proteins already were in an, at least partly, 
unfolded state. Thus, by inserting these only distantly related 
CDR sequences into the IGHV3 framework, the boundaries 
that define a stable immunoglobulin fold had apparently been 
exceeded. 

DISCUSSION 

Insertions and deletions of nucleotides have recently been 
shown to be an additional mechanism whereby immunoglobu- 
lin V region genes are evolved (8-11) and which may expand 
the available repertoire of antibody hypervariable loop lengths 
and structures. Although sequence modifications of this kind, 
especially insertions, have also been exploited in antibody en- 
gineering, knowledge about the effects of these modifications 
on protein stability and antigen recognition is still limited. 
Such factors are critical as they determine the success of this 
mode of molecular evolution, whether employed by nature or by 



the molecular engineer. To study the functional consequences 
of both insertions and deletions in the CDR of human antibod- 
ies, we have here made single-codon insertions and deletions as 
well as more extensive modifications in the CDR of two anti- 
body fragments with different specificities and assessed the 
thermal stability and the antigen binding properties of the 
resulting proteins. 

The single-codon modifications were well tolerated by the 
two scFv frameworks as determined by the thermal stability 
measurements and the high ratio of functional clones despite 
the fact that they created both loop lengths that do not occur 
normally within the human IGHV3 subgroup and combina- 
tions of loop lengths that do not exist in the human germline 
repertoire. Insertion of one residue in CDRH2 of the two scFv 
studied here creates a loop length (CDR2-IMGT length 9 amino 
acids) that is not naturally encoded by any IGHV genes except 
for the only member of the IGHV6 subgroup (7). This loop 
length has been predicted to have its own distinct conformation 
(canonical structure 5, Ref. 31), but as no immunoglobulin 
encoded by this gene has been structurally determined, this 
canonical structure has not been defined The insertion of one 
residue in CDRHI produces a loop length (CDR1-IMGT length 
9 amino acids) that occurs naturally within the human IGHV4, 
but not the IGHV3 subgroup, and which could correspond to 
canonical structure 2 as judged by the automatic canonical 
structure classification. This coexistence of canonical structure 
2 in CDRHI with canonical structure 3 in CDRH2 (Table I) 
does not occur naturally within the human IGHV germline 
repertoire, although it has been observed in hypermutated 
antibodies with insertions in CDRHI (8). In addition, the struc- 
ture classification also revealed that a large number of the key 
residue requirements for canonical structure 2 were not ful- 
filled (27), i.e. the thu3 modified CDRHI loops either take on 
structures not covered by the described canonical structures or 
adopt the observed structure corresponding to this loop length 
despite the presence of a large number of disallowed amino 
acids at key residue positions. Irrespective of the circum- 
stances, the insertions in CDRHI seem to, like the rest of the 
single-codon modifications, give rise to scFv that are correctly 
folded and stable. 

The fact that the loop lengths that were created by the 
single-codon insertions are not part of the IGHV3-encoded rep- 
ertoire does not mean that they are completely unnatural in 
the context of an IGHV3 framework. Apparently functional 
antibodies belonging to the IGHV3 subgroup with insertions in 
CDRHI and CDRH2 leading to CDR-IMGT loop lengths of 9 
amino acids have in fact been described by others (8, 34, 35). As 
the deletions at position 58 in CDRH2 of both scFv give rise to 
loop lengths that are used by other members of the IGHV3 
subgroup, it is not entirely unexpected that these modifications 
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Fig. 3. Clones carrying CDRH1 sequences from distantly related IGHV genes displayed a polyreactive character. Reactivity of the 
AE11FO. E3 (■), E6 CO). E10<A), Ell (♦). and E14 (•) scFv with strcptavidin-bound viral peptide (A), streptavidin (B), BSA iC>, ovalbumin 
ID), and uncoated polystyrene wells (£), as determined by ELISA. Relative concentrations of the expression supernatants were estimated by 
immunoblotting. The coefricient of variation was below 10% for the whole data set. 

polyreactivity is most likely due to a destabilized or inappro- 
priately folded V domain, as demonstrated by the CD spectra of 
two of the clones (Fig. 4). Destabilizing effects of loop grafting 
into an antibody framework have been reported previously 
(40), but in that particular case, the grafted sequences were 
totally unrelated to antibody hypervariable loops. The use of 
naturally occurring CDR sequences for grafting into immuno- 
globulin frameworks often ensures that the inserted loops are 
optimally functional as they have been proofread and selected 
for functionality during the formation of the B cell receptors. 
Our data show, however, that the functionality of the grafted 
loops also depends on the framework they are inserted into 
even if they are natural immunoglobulin sequences. The reason 
for the observed effects probably lies in the differences in cer- 
tain key residues between the 1GHV3 and 1GHV4 frameworks. 
In fact, many of the amino acids that differ between the origi- 
nal AE11F sequence and the grafted sequences are residues 
that are used to define the canonical structures (27, 31). In 
addition, Tramontano et al. (41) have shown that framework 
residue 80 of the heavy V domain packs against residues in 
both CDRH1 (position 30) and CDRH2 (position 58) and that it 
is an important determinant of the conformation of the CDRH2 
loop. A subsequent mutational study has also shown that the 
nature of this residue determines the binding characteristics of 
an antibody by influencing the conformation of the heavy chain 
CDR loops (42). The AE11F framework has, like all unmutated 
antibodies belonging to the 1GHV3 subgroup, an Arg at posi- 
tion 80, whereas all genes belonging to the 1GHV4 subgroup, 
from which the CDRH1 sequences were obtained, encode a Val 
residue at this position in their germline configurations. The 
larger, charged Arg possibly causes clashes with the IGHV4- 
derived residues in and adjacent to CDRH1, which leads to an 
improper fold and poor stability of the resulting scFv product. 

In conclusion, we demonstrate here that single amino acid 
insertions in both CDRH1 and H2 and deletions in CDRH2, 
which are highly representative of modifications that occur 
naturally in regions of the hypervariable loops known to be 
involved in antigen contact (15) during the maturation of B cell 
receptors, are well tolerated and permit production of stably 
folded proteins. This is true despite the fact that the thus 
modified loops do not fulfill the key residue requirements for 
canonical loops of the corresponding length or are of a length 
not associated with a known canonical structure (27). This 
demonstrates the plasticity of antibody V domain frameworks 
belonging to the important IGHV3 subgroup, which makes up 
a large fraction of all human antibodies (43), and its capacity to 
tolerate modifications that expand sequence and structure 
space beyond the limits set by the germline-encoded diversity. 



Fig. 



. CO spectroscopy indicated an unordered folding of the 
CDRHl-grafted clones. CD spectra of purified monomers of the 
AE11F Uhick solid line), FITC8 Uhin solid line), E3 Uhick broken line), 
and E6 {thin broken line) scFv in 50 mM sodium phosphate, pH 7.4. 



are tolerated by the scFv frameworks studied here. Further- 
more, in a previous study, we have found that single-codon 
deletions, some of which have also been shown to be functional, 
occur in antibodies belonging to the IGHV3 subgroup at or 
immediately adjacent to position 58 (12). The single-codon 
modifications of antibody sequence space we have presented 
here are in other words highly representative of changes that 
may occur naturally as a consequence of the somatic hypermu- 
tation process. 

As some of the single-codon insertions produced loop lengths 
found in antibodies belonging to the IGHV4 subgroup, we de- 
cided to investigate the possibility of using CDRH1 sequences 
originating from this subgroup to diversify the AE11F scFv. 
This approach resembles evolution through receptor revision, 
which occurs in vivo (36, 37) and has also been shown to 
provide a selection advantage in vitro (38). However, grafting of 
CDRHl loops of different lengths from the IGHV4 subgroup 
into the 1GHV3 framework used by the AE11F scFv resulted 
not only in a loss of the original antigen specificity but also in 
the acquisition of a polyreactive character, even when not hav- 
ing been put through a potentially denaturing purification 
process (39), by the thus modified scFv clones (Fig. 3). This 
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Based on the similarities with naturally occurring alterations 
of loop lengths, our results with insertions and deletions in 
CDRH1, H2, and LI of the antibody fragments used in this 
study, and work on an unrelated scFv with a three-amino acid 
insertion at the beginning of CDRHl (10), 3 our conclusion is 
that both insertions and deletions can be efficiently utilized in 
antibody engineering to expand the structural space available 
to human antibodies as long as attention is paid to key residues 
in the framework (41). As demonstrated by previous 3tudie3 on 
murine antibodies, this approach can be used for improving 
already existing specificities (17, 44). However, analogously 
with the correlation between CDR loop lengths and the antigen 
recognized (16), it is conceivable that it may also be utilized for 
the construction of antibody libraries specific for a particular 
class of antigens such as haptens, peptides, or large molecules. 
Finally, we hypothesize that introduction of novel loop lengths 
and combinations of loop lengths not encoded by the germline 
repertoire may also enable the targeting of poorly immunogenic 
or previously unrecognized antigens and epitopes as entirely 
new regions of antibody structure space are explored by this 
mode of sequence diversification. 
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